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ABSTRACT 


The objective of this reeeerch is to nodlfy Fleher'e eenple linear 
dlacrlvlnant function through an appropriate alteration of the connon aanple 
variance-covariance natrlx. The alteratlone conaleta of adding nonnegative 
valuea to the elgenvaluea of the sample variance-covariance matrix. The 
desired results of this modification Is to Increase the nus^er of correct 
classifications hy the new linear discriminant function over Fisher's 
function. This study Is limited to the two-group discriminant problem. 

The present research has Identified several feasible alterations on 
the sample variance-covariance matrix which produce several different biased 
linear discriminant functions. The performance of the biased discriminant 
functions are compared through Honte Carlo experiments. Comparative perfor- 
mance is based on the Conditional Probability of Mlsclasslflcation (PNC). 

Each biased discriminant function has been evaluated over seventy-two (72) 
different computer simulation design configurations which gave consideration 
to: 

(1) Sample slae, 

(2) near-singularlty In the variance-covariance matrix, 

(3) Hahalanobls distance, and 

(4) orientation of mean vectors. 

Initially, it was believed that sufficient Improvement in the conditional 
PMC could be gained by defining a new discriminant function through the dele- 
tion or small eigenvalues (equating them to zero) in the sample variance- 
covariance matrix. However, the difficulty of determining a "cut-off" value 
led the researchers to consider several additional alternations on the sample 
variance-covariance matrix. 
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1 XHTSODUCIiaH 


In nany situations it is necessary to assign (or classify) an object 
into one or two groiq>s under conditions of uncertainty. As an aid in tlu.« 
classification process, procedurea haue been developed whereby an object is 
measured on p variables whose values are believed to be influenced by the 
group to which the object belongs, nieae measurements are compared, in 
some way, with corresponding measures for objects known to belong to each 
of the two possible grot^s under consideration. The object is then assigned 
to the group to which it is most similar; similarity is based on some kind 
of distance function. In this study, that distance function will be called 
a discriminant function. 

IVo of the best known discriminant functions developed to handle classifi- 
cation problems of this nature are Fisher's (1936) linear discriminant 
function (LDF) and the W classification statistics discussed by Anderson 
(1958). Fisher's LDF and Anderson's W give identical results when applied 
to the same set of observations. In fact, one is a linear function rf the 
other. 

In any classification problem, it is desirable to get a measure of the 
chance that an object will be mlsclassified by the discriminant function. 

This measure of mlsclassification is commonly called the probability of 
mlsclassification (FMC). Using Fisher's LDF, one may compute the exast 
probability of mlsclassification if the probability distribution for the two 
populations is multivariate normal with known equal covariance matrices and 
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known anan vectors* However t In practice • the eoanon covariance aatrlx and 
aean vectors are unknown and are obtained by unbiased seaple eatiaetes* Hhen 
saaple estiaates replace the population paraaetera in the U>F (LDF becoaea 
the aaaple linear diacriainant function* SLDF), the exact probability of 
alaclaaslflcatlon becoaea difficult to coapute becauae the distribution of 
the SLDF la virtually Intractable (Lachenbruch* 1975). However, If the 
saaple estiaates In the SLDF are considered fixed, the SLDF has a conditional 
univariate normal distribution, and the conditional probability of aisclasal** 
fication can be counted (under the given fixed conditions). Hills (1966) 
showed that the exact probability of mlsclasslflcatlon obtained froa the LDF 
la always less than the conditional probability of alsclasslflcation computed 
from the SLDF. This study is concerned with the problem of decreasing the 
conditional probability of mlsclasaifylng an observation when fixed estimates 
of the population parcmeters are given. 

Many statisticians have investigated the behavior of the SLDF. The 
exact distribution of SLDF was studied by Wlad (1944), Anderson (1951), and 
Okamota (1963); estimation of error rates was studied by Dunn (1971), Hills 
(1966) , and Lachenbruch and Mickey (1968) ; variable selection was studied by 
Cochran (1964), McKay (1976), McCabe (1975), Habbema and Hermans (1977), and 
Van Ness and Simpson (1976) . Robustness to various departures from assump- 
tions was studied by Gilbert (1968, 1969) and Krzanowskl (1977). Rao and 
Nitra (1971) used the singular multivariate normal distribution to construct 
a discriminant function between two alternative normal populations with 
singular covariance matrices. Recently and more relevant to the present 
work, DiPUlo (1976, 1977) and Smldt and McDonand (1976) showed that estimat- 
ing the population covariance matrix in the LDF with a certain biased 
estimator results in a decrease of the conditional probability of misclassl- 
fication. DiPillo (1976, 1977) used Monte Carlo sampling experiments; the 
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rcaultt of his expsrlaents suggsst that if the population covariance «Btrtx 
is ill-conditioned (its deteminant is near sero), the aasqple covariance 
natrix can also be expected to be ill-conditioned. Therefore, the condition- 
ing of the saaple covariance oatrlx has an effect on the perforaance of the 
SLDF. Prior to DiPillo, Bartless (1939) slaq^ly alluded to the unstableness 
of variable coefficients in the SLOP but did not pursue the problem any 
further. 

Biased estimators have received a great deal of attention in relation 
to regression analysis. For the general linear model, it is veil known that 
least squares methods provide sstimators with minimum variance within the 
class of all unbiased estimators. However, within the last decade, much has 
been written about the application of biased estimators to the linear model. 
Hoerl and Kennard (1970) Introduced a biased estimation procedure known as 
Ridge Regression. Other biased regression procedures are Latent Root 
Regression, Introduced by Webster, Gunst, and Hason (1973) and independently 
by Howkins (1973), and Principal Components Regression, discussed by Hassy 
(1965), Hocking (1976), Mansfield, Webster, and Gunst (1977), and Marquardt 
(1970). Relatively little has been done regarding the application of biased 
estimators to the linear discriminant function. This study is an attempt 
to apply principal component procedures in order to modify the SLDF to 
Include bias. 
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2 BASIC PBOPBRTIES OP THE LINEAR 
DISCRIMIRAMT FDNCTIOR 

2.1 The Population Diacrlmlnant Function 

Let ■ (X • X » . . . , X^) bo a randon vector fron one of two popula- 

tions or * 2 * K denote the domain of the p-dimenslonal vector. It 
la desired to claaalfy X Into one of these populations. The objective In 
devising a rule of classification la to partition R Into R^ and R 2 by soma 
optimum method so that: 

If 2 falls In Rj^t assign the object to 

If X falls In R 2 * assign the object to V 2 

This classification process Involves two kinds of errors* namely* that 
(1) an object Is assigned to population when It really belongs to ^2 

<2) an object may be assigned to ^2 when It really belongs to A good 

classification rule should minimise the probability of these errors In 
classification. 

In order to construct a mre specific characterization of the dis- 
criminant problem* the following symbols are defined: 

f.(^ ■ the joint probability density of elements of X for 
^ population f^ Is assumed to be continuous. 

qj ■ the prior probability of obtaining an observation from v^. 

P(i|j) ■ the probability of classifying an observation into v. when 
It Is really from (1 f* j). 

TP - the total probability of mlselasslficatlon. 
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81ne« la the doaaln for elaaolfying an objact into a obaarvation 
will Hava alaclaBOif leation probability 

P(l|j)-/ (l^J) • (2*1) 

% ^ 

From (2.1), 

TP-P(2|l)q^ + P(l|2)q2 (2.2) 

As Indicated above, a good claaslflcatlon rule la devised when end 
and R2 are chosen such that TP Is minimised. The minimum value of TP will 
be denoted by OPT. Anderson (1958), using an approach Introduced by Welch 
(1939), shoved that 

\ - {X|fi(2)/f2<2> 1 92 /<Ii> <2- 3) 

and 

‘*2^V 

are the regions that minimise (2.2). Actually, ^ most 

appropriately called the likelihood ratio which minimizes the TP. 

No matter what the distribution fj(X) is, statements (2.3) and (2.4) 
Imply the following classification rules for an observation 1^: 

If fj^(^)/f2(i|^) 1 92/92* classify into ty (2.5) 

If f2(Xjj)/f2(X^) < 92/92* classify Into V2* <2.6) 

Now assume that the distribution fj(X) Is multivariate normal. That Is, 
t It) . 1 „ . . ( 2 - 7 ) 
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where j - 1,2, U. le the Bean vector of X In v.* and £. la the varlanee- 

‘T ~ j j 

covariance Matrix of X in With this aasui^tion, an eqaivalwat font of 
(2.5) and (2.6) can he derived. Taking the natural logaritin of both eites 
of f]^(X)/f2^ ■ ^2^^1* obtains 


tn(f^(X)/f2(X)) - (1/2)X'(E2^ - Sj^)! + X* (ij^^ - £ 2 ^ 2 ) 


* in 


' 


- fn ^2^^! 


The second expression in the equalities in (2.8) is called the quadratic 
dlscrlainant function because it is quadratic in the conponents of X. If 
and V 2 ^ oot differ in their covariance sMtrices, that is, if 
■ £2 * (2.8) reduces to 

(X - (1/2) (Uj + U2)l'l“^(S2 - Uj) - to q2^*»l • 

where the left side of (2.9) is linear in the conponents of X. Hence, the 
population linear discriminant function D(X) is defined by 

D(X) - (X - (1/2) (U^ + U2)ri’^(Sl - £ 2 ) 

- X’£“^(U2 - U 2 ) - (1/2)(02 + U2)’l"^(Wl " V * 

The first tern ct the extreae right member of (2.10) is the thMretical 
equivalent of the linear discriminant function proposed by Fisher (1936). 
The expression given by D(X) in (2.10), which is a discriminant function 
used in this study, was denoted by Anderson (1958) as V. 

If it is further assumed that 9^ " *^2 * (2.5) and (2.6) in 

terms of D(X) becomo: 


(2.U) 

( 2 . 12 ) 


If S(X^) 1 0. Msisn Into 

If D(^) < 0, nsaign Into «2 

Roto that tha raglona Rj^ and R 2 art now dafinad by ■ (X|d(X) > 0) and 
^2 " Fron (2.1), it can ba aaan that 

P(l|2) - / f,(X)dX andP(2|D- / f,(X)dX. (2.13) 

D(X)<0 * 

Alao» DQC) la univariata nomal bacauao It is a linear function of conponants 
of the nultivariate nonal vector X. If a transfomation U ■ D(X) • along 
with (p-1) other suitable transfomatlons, is defined, one can see that the 
range of integration in (2.13) dependa only on 0. When the other (p-1) 
variables are integrated out, (2.13) reduces to 

o 

P(2 1) - / N2(U)dU , P(l|2) - / N2(V)dD , (2.14) 

o 

where and N 2 are univariate normal probability distributions of U in 
and « 2 * i^espectively. 

Since 0 - D(X), it is clear that 

P(2 ll) - Pr(0 < 0 |X e Vj) - Pr(D(X) < 0 |x e ir^) 

and 

P(1 12) • Pr(0 > 0 fel e * 2 ) ■ Fr(D(X) i 0 ^ e Vj) 

Purthemore, the aeans of D(X) are, 

E(D(X) fee c »j) - (1/2)(02 - " 

E(D(X) c Wj) ■ (-1/2) (0^ - 2,2>'^"^(2 i ■ £ 2 ^ “ 
and the variance is 


(2.15) 


-D^ , (2.16) 
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V«r(D(X> (X 1 - Var(D(X)| X c Wj) 

- (Oj - • D* ’ 


«h«r« “ £ 2 ^' cttrrant lltcratur«» D ■ 

la called the M^alanobla dlataaea batwaan vactora and £ 2 * 

ly aaklng a cransfocaation froa 0 to T ■ lO ' E(U))/D, tha univariata 
atandard noraal diatribntion la obtaload. Hanca, 

P(2|l) - Fr(U < 0|X e w^) 

(2.1,) 

• Pr<Y < -D/2) 

- P»<T < -D/2) 

- ♦(-D/2) . 

ahare ♦ la tha atandard noraal cuaulatlva diatrlbutlon. Slaillarly, 

P(l|2) - 1 - ♦(D/2) - ^(-0/2) ^2.19) 

Slnca (2.18) and (2.19) ara conaaquencaa of (2.3) and (2.4), the optlaua 
probability of aiaclaaalflcatlon la glvan by 

OPT - (1/2) [♦(-D/2) ♦ ♦(-D/2)l - ^(-D/2) , (2.20) 


where 9^ “ 92 . 1/2 in (2.2) 

2.2. Tha Saapla Dlacrialnat Function 

Mote that all tha raaulta of aactlon 2.1 ware obtained under tha aaauap- 
tlon Uiat I, and £2 are fixed and known population paraaatera. In aoat 

appllcatlona, £, £ 2 * and £2 ara unknown and auat be aatlaatad froa aaavla 
data, nia claaalcal approach In thla caae la to replace £^t £ 2 * ^ In 

D(X) with thalr »ampU countarparu £ 2 , and 8, where ^ la tha aaaqila 
aatlaata of £. end 8 la the pooled aaapla aatlaata of 1. That la. 
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kj • 


and 


jj «u - SiXSil - k>' * <Sl2 - 22><2 i2 -k>\ 

+ Hj - 2 


21 ) 


where ■ ith random observation vector for population J* n^ s site of 

random sample from populac'') , i ■ 1, 2, ...» n^» j ■ 1»2; and S are 
unbiased estimates for ^ and £» respectively. Making these substitutions in 
(2.10) » one obtains the sample analogue of D(^ as 

Dg(X) • IX - (l/2)(Ii + l2)l’s“^€i - I 2 ). (2.22) 


The rules of classification for a future observation X are if D_(X ) > 0» 

assign ^ to otherwise, assign it to This assumes that <I2 " *l2* 

Recall that the distribution of D(X) is univariate normal. The 

unconditional distribution of D (X) is not so easily handled. In fact the 

8 

unconditional distribution of D (X) is virtually Intractable because S, X, and 
2^ (j * 1,2) are all random variables. Hmrever, one can determine the distri- 
bution of D (X), provided ^ (J « 1,2) and S are considered fixed values. 

When these values are fixed, D (X) has a conditional univariate normal 

S 

distribution and the conditional means and variance of D (^) can be determined. 
That is, 


E(Dg(X)lX^, X 2 , S, X e ir^) « (U^ - (1/2) (X^ + X2>)'S"^(X^ - X 2 >* 
E(D^(X)| X^, X2, S, X e IT ) - (U^ - a/ 2 )(X^ +l2)>'S"^(Xi " 


(2.23) 


and 


Var(Dg(X)|x^, X 2 , S) - (X^ - l2>’x“^ "^(X^ - X 2 ) • 


4 K 


Since D^(X) in univariate normal when given that Xj* ^ are 
fixed* the probability of miaclaaaification baaed on the fixed valuea is 
computed by 

PHC - (1/2)(P,(1|2) + P.(2|l)), (2.24) 
8 8 


where 


P^(l|2) - Pr(Y > yj) . Pg(2|l) - Pr(Y < y^), (2.25) 


and _ . _ , _ 

-(U. - (1/2) (X, + X,))*S‘^(X, - X,) 
yj - -^3 i J 2 ^ . 1,2) . 

(dl - X2 )*s”^ES“^(Xj - Xj))^^^ 


(2.26) 


and T has the standard normal distribution. The calculations leading to (2.26) 
are given in appendix A. 

The reader should note that (2.24) is not of much use in computing the 
pm in a practical situation because the y^ In (2.26) cannot be evaluated 
unless exact values of and £ are known. However* (2.24) can be evaluated 
in sampling experiments where random observations are generated from known 
values of £ and 1^ . This approach will be used to compute the PMC in this 
study. 

Lachenbruch (1975) and Hills (1966) called the PMC computed by (2.4) 

the actual error rate of D (X). Hills also showed that 
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E[9(-D 72)1 < (l/2)[P(l|2) +P(2|l)] < (1/2)[P l|2) +P„(2ll)l* (2.27) 

8 8 8 

where D ■ (X. - X.)'S ^(X, X ) and E[9(-D /2)] is the expected value of 
S “Z X s s 

the estimate of 4(-D/2). 

An objective of the present research is to show that D (]p can be 
modified so that the right member of the inequality in (2.27) is closer to 
the middle member. 
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3 APPLICATION OF PRINCIPAL COMPONENTS 


3.1. Population Principal Components 

The principal components technique originated with Karl Pearson (1901) 
as a means of fitting planes by orthogonal least squares and tias further 
developed by Hotelling (1933) for the purpose of analyzing correlation 
structures in a multivariate system. However, principal components theory 
can be studied by putting the usual developments of eigenvalues and eigen- 
vectors of positive semidefinite matrices in statistical terms. This 
treatment is given below. 

Let X be a p-component random vector with mean 0 and covariance matrix 
It where £ is a real positive semidefinite matrix. Let ^ <1*2 L * • • 1 'i’p L ^ 
be the eigenvalues of £. It is well knovm from matrix theory that there 
exists an orthogonal pxp matrix Z such that 


ZZ' - Z’<) or E - Z'vZ • 


(3.1) 


where 4* * Is s diagonal matrix of eigenvalues of E and Z'Z “ I. 

Note that for purposes of this study, a pxp diagonal matrix with elements d^^ 
on the diagonal shall be denoted by column of Z', or 

equivalently the ith row of Z, is the eigenvector that corresponds to the ith 
eigenvalue 

Let V be a p-component vector such that 


V - ac - 



(3.2) 
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Where is the 1th tw of Z. That is, V is an orthogonal transfomation of 
X. The elements ^ 2 * *** vector V are called the principal 

conq>onents of X. 

From (3.1) and (3.2), It can be seen that the variance-covariance matrix 
of the elements of the vector V Is denoted by 

Var(y) - Var(ZX) - ZEZ* - 4* . (3.3) 

Hence, the first population principal component is ■ Z*X with variance 
and the ith principal component is ■ g^X. 

3.2. Sample Principal Components 

Assume now that the p-component random vector X has a multivariate 
normal distribution with mean U and variance-covariance matrix Z and that a 
random sample of size n is available from the population of this distribu- 
tion. An estimate S of £ may be computed from this sample, where S is at 
least positive semidefinlte. Denote the eigenvalues of S by ^ ^2 ~ *** 

^ ^ 0. Just as in (3.1), there exists an orthogonal matrix T such that 

S - T'AT , (3.4) 

where A * ^ diagonal matrix and T'T > I. The sample principal 

conq>onents vector is defined by m « IX for a vector of observations, X. 13ie 

I 

1th sample principal component is m^ • where t| is the 1th row of the 
matrix T and m^ is the 1th linear compound of the p components of X* 

From a statistical point of view, the basic idea of principal components 
analysis Is to describe the variation of an array of n sample points in a 
p-dimensional space by as few linear compounds of the p-space variables as 
possible. For example, the sample variance of the ith principal component of 
S is ■ X^, where A^ is the ith largest eigenvalue of S. If s 
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P P“8 

eigenvalues of S are sero, then trace S ■ ^ s.. I hence» the study 

i-1 i-l 

of p variables can be reduced to a study of the first (p-s) sample principal 
components because all the variation in the data is accounted for by the 
first (p-s) saiq>le principal components. 

For a clear picture of situations where S nay have s zero eigenvalues 
as opposed to having s eigenvalues that are near zero, consider the following 
situations. Suppose first that n < p. Then the rank of S is known to be 
less than p (i.e., at least (p-n) eigenvalues are zero) because n(<p) points 
cannot possibly span a p-space. Alternatively, if n > p and there are s 
eigenvalues of S that are near zero but not exactly zero. Multicollinearity 
exists whenever one or more of the eigenvalues are near zero. Much has 
been written about the application of principal components analysis in this 
situation; see, for example, Morrison (1976), Rao (1964), or Gnanadeslkan 
(1977) . 

Until recently, the application of principal components analysis has 
been restricted to the analysis and dimension reduction for a multiple 
variable system. Some of the more recent applications of the principal 
component technique are provided in section 3.3. 

3.3. Principal Components Regression Anaysis 

Consider the standard multiple linear regression model 

Y » M + £ , (3.5) 

where 

Y is an (nxl) vector of observations on the response variable. 

X is an (nxp) matrix of n observations on p independent variables, 

8 is a (pxl) vector of unknown parameters, 
and 

e is an (nxl) vector of unobservable random-error variables. 
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2 

such that E(e) • 0 and E(c e,') • o I, where I is en (nxn) identity sntriXt 

X 2 

0 and (n 1) vector of seros, and o is a nonnegative scalar. Frequently, the 
elements of Y and X are standardised; however, this restriction is not 
necessary for the present discussion. 

The usual least squares estimator of £ is given by 

I - (X*X)"^X*Y , (3.6) 

with E(E) ■ ^ and Var(|) ■ (X'X)~^o^. The properties of this estimator are 
well known so the present review need not be extensive. For a more detailed 
treatment, the reader may consult, for example, Graybill (1976). 

One of the well known properties of the estimator ^ is that it is 
unbiased and the variance of its components is minimum within the class of 
all unbiased estimators of J|^. However, difficulties arise with this 
estimator when X'X is near-singular or, equivalently, when strong multi- 
colllnearities exist in the sample data. One of the primary difficulties is 
that multlcollinearity causes the components of ^ to have large variances. 

To correct for the difficulties that arise when X'X is near-singular. 
Massy (1965), Marquardt (1970), and Hawkins (1973), among others, have 
recommended a technique called principal components regression. Another - 
approach for overcoming problons associated with data multlcollinearity is 
ridge regression, proposed by Hoerl and Kennard (1970). Hoerl and Kennard's 
ridge estimator is defined by 

^ - (X'X + K)’^ x4 

idiere K is a general diagonal matrix and the principal components estimator 
is defined by 
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where 


• «•»- X'T 


(X'X) - T'*' I, 

o 


A 


S 


1 

X, 


1, 

I. 


8 o 


where X^« X2» •••* X^ are the eigenvalues of X'X, g ■ p-a, the last a 
analleat eigenvalues have been equated to zero and T la an orthogonal matrix 
of X'X. The matrix (X'X)** la generally referred to aa the generalized 

A A 

Inverse of T*A^T. Although and are biased estimators of It can 
be shown that they are more stable (their components have a smaller 

A 

variance than the corresponding components of than the least squares 


estimator 


3.4 Relation of Ridge Estimators to 
"rinclpal Components Estimators 

In discussing the relation of ridge to principal components estimators » 
it is convenient to employ a general form of a larger class of estimators 
presented by Gunst and Mason (1977). Their general form is 


A M 

i* ■ J . c t . 
1-1 ^ 


(3.16) 


where a^ depends on the particular estimator, c^ ■ ^]^X'Y is the same for all 
estimators, and ^ is the ith eigenvector of X'X. Gunst and Mason showed 
that least aquares, principal components, ridge, and two other biased 
regression estimators may be obtained from this general form by assigning 
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the appropriate values to a^. Socking (1976) gave an alternate varaion of 
(3.16). The general fora in (3.16) is equal to ^ if a^ ■ 1/1^» to Hoerl and 
Kennard's ridge eatinator* ^ if a^ • 1/(1^ *** k)» »oid to idien a^ ■ 1/X^ if 
i i p-a and a^ ■ 0 if p-a < i £ p. 

In suDnary, principal coaponenta techniques are a fundaaental process 
through which biased estiaators for the general linear aodel have been 
developed. In every biased estiaator of regression paraaeters, the eigen- 
values and eigenvectors of X*X play an essential role in their development. 
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4 PRINCIPAL COMPONENTS THEORY IN REUTION 
TO DISCRIMINANT ANALYSIS 

4.1 Analogy of Discrinlnant 
Analysis with Regression 

A natural parallel exists between the two-group linear dsicrinlnant 
analysis problem, as developed in section 2, and multiple linear regression. 
Kshlrsagar (1972), Lachenbruch (1975), and, of course, Fisher (1936) 
showed that by using dunmy independent variables, the regression model can 
be used to derive the sample linear discriminant function (2.22). In (2.22) 
recall that D^(X) was defined by 

Dg(X) - (X - %(Ii + l 2 ))'S"^(Xj - Xj) or, equivalently, 

»g(X) - ■ ^ 2 ^ " ^^1 “ V • 

The first term on the right side of (4.1) is a linear combination of the 
components of X, where S ^ (X - X 2 ) is the sample estimate of the population 
coefficients Z ^(U^ - term is a constant for fixed values 

of Xj^, X^* Recall that one purpose for altering a near-singular 

matrix X*X in £ • (X*X) ^'Y was to reduce the variance in the coiq>onents 
of Because of the natural connection between linear discriminant 
analysis and linear regression, it seems natural that store stable estimates 
of the discriminant coefficients E ” ~2^ **ould produce a discriminant 
function whose PMC is lower than the PMC of D^(X). In fact, DiPillo (1976) 
and Smidt and McDonald (1976) showed by Monte Carlo experiments that the 
application of the ridge technique to discriminant analysis improved the 
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PHC of the sanple discrimln«it function. They propoeed an alteration on 
the coBBonly ueed eanple function; the general fom of their biaaed diaerial** 
nant function ia 

- (X - + Xj)) • (S + kl)"^ di - |2> • <*•*> 

Where k ie a nonnegative scalar and S, ^ (J ■ 1»2) are as defined in 
section 2. DiPlllo selected k ■ 1 while Smidt and McDonald detemined the 


constant k by 


k ■ eXp , 


(A.3) 


idiere X^ is the snallest eigenvalue of S and c ■ (pf2)/(H-p-2), where p is 
the nuad>er of variables in X and N is the total sample sice used to estimate 
S. Smidt and McDonald called h|^(X) the ridge discriminant function. In 
this section, new biased discriminant functions will be Introduced. 


4.2. The Effect of the Position of U. - U. 
on the Variances of the 
Discriminant Coefficients 

The previous discussion stated that the two»population discriminant 
function can be derived through multiple linear regression techniques. 

Recall from (3.5) that is the vector of regression parameters to be 
estimated, and the wd>iased estimator is given in (3.6). For the linear 
discriminant problem, the population parameter - £ 2 ) of the first 

term in the last manber of equality (2.10) is the vector of population 
discriminant coefficients. The sasqtle estimate of these coefficients is 
obtained by replacing I and (J ■ 1.2) by their sample counterparts. 

Just as small eigenvalues in X'X Inflate the variances of the components 
of 6» the variances of the components of S**^(X 2 - nay be large for 
similar reasons. Das Gtq>ta (1966) showed that the variance-covariance 
matrix of - Xj) is 
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(4.4) 


- ^)) - 


%4i«r« 


and 




♦ *2 • 

i (n^ + Hj - p - 2)cn^ ♦ nj - 3)(iij ♦ - p - 5) 


(«. ♦ n, - 3)(n, + nO 

t- ■ * 

2 *l"2 


, »1 * ■‘2 - » - t 

3 ♦ «2 “ P " 3 


I is ths (pKp) idsneity mscrlx . 


vector of Z**^, and 1/p^ Is the 1th eigenvalue of Z~^. Then the expression 


Let d •« U - U 2 t 0^ ■ angle between ^ and s^. where 7^ Is the ith eigen- 
vector of Z~^, and 1/p^ Is the It 
given by (4.4) nay be written as 

I 
i 

VarlS* 


-1 - _ f P ((d'd)‘*cos 2 ,)^ 

S ‘(Si - Si>l - I ♦ ‘ji 

r E E 

■U — iT ^ J )'- i ""''!.!; • 


(4.5) 


If at least one eigenvalue In (4.5) Is small, then at least one component 
of S~^(X 2 “ X 2 ) has a large variance. 

The expression In (4.5) allows an assessment of the effect of the 
position of i on the variance of the components of S ^(X^ - X 2 ). If 
Is small and d Is orthogonal to Z^, then the variability In certain 
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ccnqwnMitB !• not aa larga as it would ba if wara aaall In eoablaatlon 
with being near parallel to d. Therefore* the poaitlon of d in the 
p-apaee ahould have a definite effect on the diacrlainant function when 
■ultlcollinearlty exiata. 

4.3. Principal Couponenta Dlacrlnlnant 
Function and Its Relation to 
the Ridge Diacrlainant Function 

A new definition of the principal conponenta diacrlainant function will 
now be given. Let 8 be the usual pooled aaaple eetlaator of C aa defined 
In (2.20). It will be useful in the sequel to think of or inverses of 
aatrlces derived froa 8 by adding at least one positive constant to the 
diagonal of 8 or tj the eigenvalues of 8 as bias«l estlaatora of Let 

the diagonal aatrlx A be the aatrlx of eigenvalues of 8* and let T be the 
aatrix of eigenvectors* so that 8 ■ T'AT. As In the case of principal 
conponenta regression* suppose that s of the analleat eigenvalues In A are 
deleted to give 


A - 

8 


idtere g ■ p-s. Then* 8 la defined by 

O 

8 • T* T and 8” ■ T* *T * (4.7) 

8 8 8 8 


(4.6) 
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Where A* xe the generellsed imrwee of A^. 

The prlnclpel coeponente eaaple diecrialnent function It defined by 

»g(S> • IS - %<*! ♦ - ^)- (♦•») 

Obeerve thet in the ridge diecrininent function given in (4.2), the 
conpounding netrix ney elveye be mpreeeed by 


(8 ♦ kl)"^ - (T*AT + kl)"*^ 


• (T*AT + kT’I)“^ - T*(A + kI)"H. 


(4.9) 


Aleo notice thet for eny poeitive conetent k, there exiete e set of constMite 
so thet 




T • T'(A ♦ kI)“^T 


(4.10) 


idiere 



(4.11) 


Pron (4.10), it is clesr thet c*/A^ ■ 1/(1^ * k); end this inpllee ■ X^/ 
(A^ <f k) < 1 whenever k > 0. Thet is, the reeults obtsined by sddlos sone 
constent k to eech diegonsl entry of 8 nsy elso be obtsined by nultiplying 
the ith eigenvelue in T'A**^i by the velue c| ■ X^/(X^ -f k) <1. This 
suggests thet e nore generel hissed estinetor of wqr be define^' by 
nultiplying the ith eigenvelue in T'A*^ by sone c^ where < 1 end c^ is 
not necesserlly X^/(X^ ^ k). A good cendldete for is ■ 1^/(X^ * k^). 
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wh«r« ^ 0 and k^ nay or may not ba equal to k^ for 1 d j* It ahould be 
pointed out that ehooelng *<ltt^valent to defining an 

estiaator of by 

T'(A ♦ K)“^ . (♦.12) 

where K ie a diagonal aatrlx. Mote that a general diaponal aetrix, 

(A. 12) la not the aaae ea (84K)~^« The reader aay refer to appendix B to 
aee why theae two aatricea are different. The perfotaance of diacrlalnant 
functione baaed on (A. 12) will be inweatlgated. Their apeclfic definltlone 
will be given in aectlon 5. 

A. A. The General Biased Dlacrialnant Function 
Let 

• IX - %<|j ♦ l2)l’T'ICi/^^lJ.lT(lj _ Xj) , (A. 13) 

wttere T «td are defined above and is any nonnegative constant less 
than or equal to one, and c denotes a generic biased discriminant function. 
If c^ ■ I for i ■ 1, 2, ..., g and c^ ■ 0 for g < i < P» where g is defined 
in (A. 6), then (A. 13) becoaes l>g(X)* 2f c^ ■ X^/(X^ 4> k), where k is given 
in (A. 2), (A. 13) reduces to 1^(X). Finally, if c^ ■ 1 for all i ■ 1, 2, 
•••* P* 1^2 (X) ia the standard saq>le linear discriminant function given in 
section 2. 

Under the condition that Xj^, ^ fixed, is normally 

distributed. Calculations similar to those in (2.23) give 

and, for aay X, 
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J [ <X -Xj) *T» [c^/XjlJ.^TET* ICj/Xjl 

The justification for biasing the S matrix in the sample linear 
discriminant function is that under certain conditions this biased discriminant 
function has a lower PMC than the standard saiq>le discriminant function. The 
lower PMC is achieved through a reduction in the conditional variance of the 
sample discriminant function. That is* it will be shown that there exists 
a set so that 

Var[D^(X) (Xj. Xj. S] < VartDg(plX^. S] . (4.19) 

It is clear that if c^ ■ c < 1 for (i * 1, 2, ..., p), then 
T’ (c/A^l J^^T - cS”^ and 

Var[D^(X)lXj.X 2 .Sl - c^ Var(D^(p lIl.Ij'S] < Var(D^(X) |X^,X 2 ,Sl. 

However, this choice for the set suitable for reducing the 

PMC because (4.18) is invariant with respect to multiplying S ^ by a 
constant. 

The following will show that there exists a set so that (4.19) 

is true. Recall that I ■ Z'^Z, where Z'Z * I. If T is any orthogonal 
matrix so chat T'T ■ I, then TZ* > P' is also orthogonal and F'P ■ I. Let 
be the vector representation of the jth column of P* and P.. be the entry 


In the ith row of and the Jth eolunm of P*. Then clearly « 
where ^ Is the 1th row of T and ^ la the Jth column of Z', and is 
also the cosine of the angle between _t^ and Let " 

(n^, mj, mp). The matrix TET* in (4.16) is now represented by P’6P, 
where P* - TZ* and T is specifically the matrix of eigenvectors of S. 
Therefore, from (4.16), 


VarlD^QC) 


n’tCi/XilJ.iP*pP(Ci/XilJ.^m 

s’Ccj/XjiJ.i 


J-1 ^ 


(Cl/*l)Plj 

(c2/X2)P2j 


l'VV^jj 


j/X2)P2j , ' 

...,(Cj/Xp)ppj J 




(4.20) 


where, if c^ ■ 1 (i ■ 1, 2, ..., p), (4.20) becomes 

I »«iv» ®i- *' • 

To complete the existence proof, it is sufficient to show that a set 
found so that 
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/ 


(4.21) 



Thus, If J Is fixed at J*, It Is sufficient to consider the corresponding J* 
term on opposite sides of the inequality in (4.22). That is, it is 
sufficient to choose such that 







(4.23) 


for each J'. If each pair of jth terms on opposite sides of the inequality 
in (4.22) is related in this manner, a system of linear inequalities of the 
form 


! 

i»i 


(Cjm^/A^)P^j 




(j - 1, 2, 


p) (4.24) 


must be satisfied by a vector £* • (c^, C 2 > .*•, c^) • It is known that 
equality holds In (4.24) if £* * 1, 1, ..., 1); and the given inequality 
holds if £* “ a(l, 1, ..., 1), where 0 < a < 1. As pointed out just after 
(4.19), £* > a(l, 1, ..., 1) is not a suitable choice for reducing the 
conditional PMC. Since equality holds in (4.23) if £* > (1, 1, ..., 1), 
elements of £ or of are selected by the following process. Let c^j 

be the cariable c^ in the jth equation that is obtained by assuming equality 
in (4.24), and let w^^j be the value of c^j where the Jth equation in (4.26) 
intersects axis c^. Mow choose 

c. - min (l,lw..j). (4.25) 

J-l,2,...,p 
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Pt then any 


If c. • min (1, |w.. 1 ) ■ 1 for all 1 ■ 1, 2, ...» 

J“l»2» • • • »P 


cond>ination of c^'a where < 1 for 1 ■ 1, 2» ...» p« aatlaflea (4.24). 


The selection of as outlined above Insures that (4.24) is true for each 


J > 1» 2» ...» p which implies that (4.22) is true for the selected set 




DiPlllo (1976) showed that 

Var[Dj^(p 11 ^» l 2 » S] < Var[Dg(X) Hj» I 2 . S] 


(4.26) 


for any k 0. However» it will be shown here that this result holds with 
less generality than originally claimed. His claim is now investigated by 
using (4.20) » where c^ is replaced by X^/(X^ -f k). 


Thus let 
h(k) 


St 

Var(D^(l!lj» I 2 , SI - 


Then 

h*(k) 




27) 


So» 


- -2 f ♦.m'A"^P,P;A"“m 

j«l J J J 

- -2m *A"^ f ♦.P^PlA'^m 

J-1 

- -2m’A“^IT'A“^m » 
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vhere A - ■ A~^. A**^. Since h is continuous and differ- 

entiable on the interval (-X , where X > 0 is the smallest eigenvalue 

P P 

of S, h is differentiable at k ■ 0. If (4.26) is true for any k > 0, then 

h'(0) < 0. That ist h is at least decreasing on some open interval contain- 
-1 -2 

ing zero. But A TIT'A is not necessarily positive definite. Hence, there- 
fore, DiPillo's statement should be slightly revised to read, "There exists 
a k^ > 0 such that (4.26) holds for all k > k^^." Perhaps h'(0) is positive 
only in extreme cases, such as for small samples; nevertheless, DlPillo*s 
claim is not generally true. In order to be certain that (4.26) is true for 

any k > 0, something must be known about £. For example, if £ ■ I, then 

(4.26) is true for any k > 0, which also means that (4.22) would be true for 

any combination of c^'s, where < 1 (1 ■ 1, 2, ..., p). 

Further inspection of I>|^(x) along with c*, where c^ is defined in (4.11) 

reveals that lim c* * 11m X /(X. -f k) • 0 for each i. This 

fc+4« ^ ^ ^ 

implies that there exists some positive such that for any e > 0, 

|cP/X. - c*/X I < e whenever k > N, . Hence, when k is large and/or if the 
X 1 p p X 

eigenvalues nearly equal, the eigenvalues of (S -t- kl) ^ ara 

nearly equal. Increasing k beyond a point where the eigenvalues are almost 
equal is the near equivalent of multiplying the numerator and denominator of 
(4.18) by the same value. Therefore, when k is selected so that 
l/(Xj^ k) &1/(X^ + k), no additional improvement in the conditional PMC 
of D|^(X) is expected from a larger k. This explains what Smidt and McDonald 
(1976) observed as an "interesting phenomenon" when they evaluated the PMC 
for 0|^(X) based on observations generated from a distribution where £ • I. 

In the present study, several such biased discriminant functions are 
evaluated and compared as outlined in section 5. For a Justification of the 
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additional biasing methods presented here, further attention Is given the 
variance In (4.20). Note that (4.20) may be expressed as 


p p '^m. 2 

VarlD (X)|X . f , S) - J J p . (4.28) 

* ^ J -1 1-1 *1 


The expanded form of (4.28), for p"3 for example. Is 

V ‘i ‘3 ; 


, ■^V32V 

‘i ‘3 y 

/ ^"l^l3 '^'^'23 *^*3^3^ ' 

V »i ‘2 ‘3 7 


W.2») 


If S were. In fact, E or at least if T ■ Z, then ■ 1 and ■ 0 where 
1 J. It Is generally expected that the terms in (4.29) that Involve the 
factor where 1 > j, will contribute more to Var[D^(X) Ix j^, Xj, S] 

than those terms that have the factor i^^/X, where 1 > J, because 
v^/Xj^ > *^/Xj whenever 1 > j. Recall from (2.26) of section 2 that the 
primary purpose for biasing 0^(X) is to Increase the absolute value of y^. 
Hills (1966) showed that jy^ | Is smaller than its population counterpart. 
Therefore, the present study proposes to bias D (X) so that biasing will 
have its greatest effect on the i^mj^Pj^j/X^ (1 “ J) terms of (4.28). The 
rationale is to add a different positive value to each eigenvalue X^ so 
that *^/X^ 1. In practice, the value of is unknown; therefore, Xj 

will be substituted for The general form of k^ will be 

ki - f^(^ - X^ + f^) . (4.30) 
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Note that If ■ 1 and f 2 ■ 0, i^/(X^ + k^) < 1 for all 1 and J, alnca 
^ X 2 _ ^ Ip* Slaulatlon axpertnanta will allow that whan la 

salectad In thla aannart for certain casea the aagnltuda of tha reduction 
In the denominator of (2.26) la greater than the correapondlng reduction 
In the numerator. Specific valuea for f^ and f 2 are given In aectlon 5. 
DlPlllo (1976) and Smidt and McDonald (1976) reatrlcted their blaalng 


alteration of D (X) to adding aome conatant k to the elgenvaluea of S. An 
a ^ 

alternative approach la to blaa the elgenvaluea of R, where R la the aample 

correlation matrix. To aee thla, let the matrix E ■ 

la the ith diagonal entry oa S; then ■ R. A blaaed eatlmate of 




F* 


1 




P 



1-1 


(4.31) 


VIhere ^ >2 — * * * — elgenvaluea of R and F la the matrix of 

elgenvectora of R, and k^ la of the form given by (4.30). When In 
(2.22) la replaced by , another blaaed linear diacrlmlnant function la 
defined. Several blaaed functlona defined In tema of S^ are evaluated In 
thla atudy. 


4.S. The Effect of Blaalng In Relation 
to the Position of - U 2 

In this aectlon, the behavior of y* In (4.18) la Investigated aa 
k^ -► For convenience, asaume that Xj - (j • 1,2) ao that y* ■ -y*. 
Since kj^ -^ +* for each 1-1, 2, ..., pla equlvtiJont to letting k^ - k ■♦ +» 
this Investigation deals only with * k for all 1. Under the above 
assumption. It la sufficient to examine only 11m yt. Note that 
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lln yi 

k-*4* 


lin 

k*H» 


"A — tt 


L-t-J t-l 


' L » J i-i L * J i.i ' (4, 
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l^d’d 


(d'£d) 


where d ■ U - U 2 » and 


D d'd cos 2 P 2 

d’l *d - I T 1 • D^. d’Ed - I ♦.d'd cos^O . 

1-1 '•'i 1-1 ^ ‘ 


where 9^ is Che angle between d and and Is the ith eigenvector of 
Z. Also, 


>sd'd 

5 

(d’Ed)^ 



and 


i' n “»** 

JjD « »»(d'2:“^d^)’‘ - 'jVd'a)’‘| i (l/*.)cos^»jj 


where is the optinun value for Y as given in Section 2, where 
Y - (0 - E[U])/D and U is given in (2.18). Consider two extreme cases: 
Case I . d is parallel to ^ for any 1 - 1, 2, ..., p. Then 
0^-0 and 8j ■ n/2 for i J. Hence, 


HCd'd)** 


?,D 




lim y* • 




0 

c 

0 


which Is the optinum value of Y. Thus» If d is parallel to any the 
optimum PMC nay be achieved by assigning a very large value to k. 

Case II. ■ e for all i,j ■ 1, 2» .... p. For this case. 


and 



From the definition of 6. cos6 ■ l/<^ It will now be shown that 

lira yj <. iD , (4 

K • •• 


when for all i ■ 1. 2, .... p. The above substitution for cos 6 

gives 

lim y» 1 iD , 


iff 


iff 


1 / P l\^ 


p ^ II ♦./♦. • P + I 


2 

The extreme rl^t mead>er of (A. 34) eontelns |(p ” p) terme of the form 
(t|ii/pj ’** reciprocal of Any positive 

number plus Its reciprocal Is greater than or equal to 2. Hence, (4,34) Is 
verified by p* ■ p + 2IJ(p* - p)l i P + ^ therefore, the 

relation In (4.33) la true. Note that If all p^*s are equal, the equality 
part of (4.33) holds. Thus, If 6^ ■ 6 and or If any 6^ ■ 0, one 

can expect to obtain a "near optlum" claealflcatlon model by biasing thu 
sample discriminant function with a large k. However, If ^ ■ 6 for 1*1, 
2, ..., p and If there Is a mixture of large and small p^'s biasing with a 
large k may produce a function that Is far from optimum. 
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S SUfOLATlOItS. DISCUSSION AND CONCLUSION 


5.1. Introduction 

The objective of the coavuter elaulatlon Is to conpare and evaluate 
the effectlveneas of different blaalng procedurea on the conditional PNC 
when £ la near-alngular. The alatulatlon la dealgned to control for the 
following factors: 

1. The severity of the isultlcolllnearlty In £. 

2. The orientation of U^^ - U 2 to the eigenvectors defining the 
anil t icoll Inear 1 ty . 

3. The Nahalanobis distance between and « 2 * 

4. The saaiple size. 

The simulations were conducted on a UNIVAC 1108 computer at the George C. 
Harshall Space Flight Center, Huntsville, Alabama, using a program written 
by the author which Incorporated subroutines from MATH PACK and STAT PACK. 

5.2. Construction 

The comaon variance-covariance matrix £ Is constructed so that varying 
degrees of singularity, or multicolllnearlty, are represented. DlPlllo 
(1976) defined his £ by 


A 

1 

A’a 

a'k 

£*Aa + 

« 
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Uhare a* ■ (1/p-l, 1/p-l) is a 1 x (p-1) vactor and nhara la aoM 

positlva scalar and A la a (p»l) x (p-I) sysmtric aatrix. Tha poaltiva 
2 

acalar o la designed as a singularity control. Xt Is lapllclt that, 

when £ Is defined by (S.l), all the variables are Involved In the nultl- 

colllnearlty. To see this, let X be a randon vector so that 

Var(x) ■ A(p.i)x(p.i) ^ positive definite. Suppose that a pth 

variable Is defined by X ■ ^ e.X. ■ e'X such that X*' ■ (X'|X ] Is a new 

P 1.1 1 i -- “ P 

Ixp randon vector where e Is an arbitrary vector. Without any loss of 
generality. It Is assuned that E(X) ■ 0. Mow, 

CovlX^, Xpl - ElX^Xp) - Ele^xjl . 


where Is the 1th colunm of A and e Is the vector of coefficients 

defining X . Also, Var(X^) ■ E[e'X]^ ■ e'Ae. Hence, 

P P ~ ~ 

Var(X*) - - 

2 

Here, It Is clear that 0 ■ 0, and thus perfect multlcolllnearlty exists and 

involves all the variables when e^ • l/(p-l) • a^ for 1 ■ 1, 2, ..., p-1, 

2 

where a^ is the 1th conponent in vector a of (S.l). If o Is Increased, the 
degree of multlcolllnearlty Is decreased. 

Following the approach of DlPlllo, let 



Uj - ("j ll'llj ) . (5.2) 

idiere a Is as defined in (5.1) and Is the (p-1) x 1 jth (J > 1, 2) 
population oiean vector corresponding to the consion variance natrlx A. 
DlPlllo stated that 

(ni - n2)’A"^n^ - n^) - (U^ - <5*5) 
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whcr* At It n^» and ^ (J lt2) ar« aa dafinad abova. Thla aqualtty ia 

raastablishad hara using any vector a In place of a. That la» let 

2 

ii * ~ -2 mmsaro (p>l) x 1 vector md o > 0. Thant 



h 

a’h 
• * 

• h'A'^h . 


Ch’ 1 I’M 


• a 

a’^ ♦ a a’/o* 

-a7o* i 1/0* 


(5. A) 


Hence, the distance between the two populations Is not affected by either 
2 

o > 0 or the form of the vector e. 

The relative position of Uj - Uj • (n^ I a'njl - (n^ | a’lijl ■ 

(n* - 112 I a*(iij^ “ 1 ) 2)1 to the pth eigenvector will now be examined where 

(J ■ 1,2) Is as defined In (5.2). If perfect multlcolllnearlty exists 
2 

In £, l.e.t If o *0 and I has only one zero eigenvalue, then the pth 

eigenvector of I Is [<-c* | 1) (or some scalar multiple of this vector) 

because when perfect multlcolllnearlty exists. It Is defined by the eigen- 

2 

vector corresponding to the smallest eigenvalue, which is zero Is o * 0. 
As o* gets larger, the pth eigenvector deviates from (-e* | 1]. Now, 

, h 

(-e* 11 — - -e’h -f e'h - 0 , 

" ‘ e'h -- -- 

which Issues that 
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- 


h 

• *h 


•a dafinad in (S.A)» la orthogonal to tha alganvactor dafinlng tha anilti- 

2 

collinaarlty* Thia naana that iihen o ■ 0, - U 2 1> confinad to tha 

apaca of tha flrat (p*l) alganvactora; and hance tha pth alganvactor 
eontrlbutaa nothing to tha dlatanca batvaan tha naana. To aea thla, ona 
naada only to Inapact - £ 2 ) by parformlng a principal 

coBponanta tranaforaatlon. That la» " f vharc (U^ ~ £ 2 ) 

and la tha 1th alganvactor of £. If d^ ■ 0, than £^ - £2 la orthogonal 
to Z^. 

Tha conatructlon for tha natrlx I as uaed In thla atudy will now ba 

defined along with tha varloua orlantatlona for tha vector £^ “ £ 2 * Let 

A be a (p-2) x (p-2) aynnatric poaitlva definite mtrlx. Let a^^ be a 

2 2 

(p~2) X 1 vector* £2 a (p*l) x 1 vector* and positive acalara. 

Let ^ 

A ! A*a 


” JJ* -I 


and 


I • 


i 




I “2 


(5.5) 


The coltann vector Z^ la tha ich eigenvector of Z and la a conatant to be 
defined below. Let 

£J ■ 0 and £J ■ f a^^ . 


i-l 
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For this study, where b^ controls the angle between Z 

and U| - Note that if ■ 1 (i • 1, 2, ...» p), then 
D - ((j^ - - U|)l* ■ 1; and 

i 

I(of - ]!$)'(»}- 2J)1^ 

where 6^ is irie angle between ^ and U* - U*. When b ^ ■ 1 (1 ■ 1, 2, ...» 

p), all principal components contribute equally to D. Also note that the 
Mahalanobls distance can be controlled by defining. 

2l - ]?■> - 

where D is the distance between and ir^. If b^ 1 for all 1, then 

(U* -U*)'Z"^(U* -U*) - (1/p) f b, . (5.6) 

1 z 1 z i-1 ^ 

Therefore, b. will be selected so that f b. > p; and hence, (5.6) has a 

i-1 ^ 

value of 1 for any set of b^'s. This sum of the b^'s is easily controlled 
by using the properties of arithmetic sequences and series. 

The b^'s are defined here in the following three different ways: 

*»i “ 1 - 1. 2, .... p , 

(2) b^ - 1 i - 1, 2 , 

(3) bj • i - 1. 2 . 

The above definitions of the b^'s are convenient for computer coding. 


! 


“ cos 6^ 
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Let N ■> + n 2 » where is the sise of the sample from ir^. Recall 

that any general biased estimator of Z was denoted by 

where - X^/(X^ + k^) and ^ 0. Now, each procedure for computing k^ 

will correspond to a particular S The k. used in the simulation study 

c 1 

here and the corresponding syndbol for are listed as (a) throu^ (f) 
below and (g) through (i) later. 


(a) k^ - ^ 


0 if i - 1 

X^ if X. > /TT and i > 1 
p i 1 

/x7 - X. + X if X. < /x7 

1 1 p i — 1 


ro if i - 1 


(b) 


K >1 > ^7 »«i 1 > 1 

- Xj + Xjj) if Aj 47 


Corresponding Symbol 
for S~^ 

c 




fO if i - 1 


(O •{ 


if X > /3T and i > 1 
N-p-i p i 1 


.« i - Xa 

I - i AT 



4.1 










Ic £of ^ * Xf 2f •••y p 

vhera 


(d) - 



and is as defined In 
(c) above 


(e) kj^ ■ 1 for i ■ 1, 2, ..., p 

(f) k^ for i - X, 2, ..., p . 


Correapondlng SymboX 
for S"^ 

S 




The choice of and the corresponding identity matrix in (f) are 

motivated by the behavior of the limit of y^ at k ■ where this limit is 

evaluated in (4.32). Although It Is clear that If k^ -> *H» for 1 ~ 1» 2, 

. . . , p« the corresponding matrix in (f ) converges to the zero matrix; 

c 

but, the ratio in (4.32) converges to the expression given there. Since 
the function Dj,(X) - (X - KXj^ + 12^1 ' ^ 2 ^ produces the identical 
ratio given in (4.32) when its expected value is divided by the square 
root of its variance provided X. * U. (j ■ 1,2), D_(X) is taken to be the 
biased discriminant function that corresponds to k^ ■ +« for 1*1, 2, 

. . . , p. 

The following symbols represent the biased estimator when the 
eigenvalues of the sample correlation matrix are biased. For this case, 
recall that S”^ - E ^F'[X/(Yj^ + ^ given by (4.31). 
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(g) k, 


0 if 1 • 1 

and i > I 

^2 ^ ^ Yj < /?7 


[ 0 if 1 • 1 

- Yj + Yp if Y^ < .^ 

[ 0 if Yj > 0.1 
*♦• if Yj^ < 0.1 


Corresponding Symbol 
for 

c 


'i* ■ *c‘ 


*«* - 


s - s"^ 

D c 


The reader should recall that the situation where a particular is ■*«> 
fdille all other k^*s are zero is equivalent to an earlier definition of the 
principal conq>onent discriminant function where the i'th eigenvalue is 
equated to zero. Each biased discriminant function is defined by 

D(j ) (X) • IX - + I2) 1 ’ S'J j (X^ - I2) . 


where J ■ A. P, G, K, 0, F, R, M, D and the unbiased discriminant function 
is denoted by D (X^) . 

For the present simulation study, p ■ 10, ■ (0, 0, l/(p-2), 

l/(p-2), 0, ..., 0), and e^ • (i, i, 0 0), where p, e^^, and are 

2 2 

defined in (5.5). This means that when both and O 2 small, multi- 

collinearities exist between variables 3, 4, and 9 as controlled by e^ 

and variables 1, 2, and 10 which is controlled by £ 2 . In order to achieve 

2 2 

the purposes outlined in section 5.1, the variables n^^ ■ n 2 * n, o^, o^, 
a^ (1 ■ 1, 2, ..., p), and D were assigned the follovd.ng values: 
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001. 10. 0 


al • . 001 . 1.0 



o'* 

“ L 9p J * Li 

rJ ■ LV-'iJ 


n - 10. 25 
D * 0.6. 1.0. 3.0 . 


This gives 72 different simulation design configurations to be evaluated 
on each of the nine different biasing procedures (a) throiigh (i). 

To evaluate the 72 configurations, a computer program was written 


to; 


1. Generage an independent random sample of size n for each Vj 
(J « 1,2) population. 

2. Compute X^, X 2 < and S for the sample. 

3. Compute the values for as defined above. 

4. Compute the conditional PMC for D (X) and for each biased 

fi ** 

discriminant function. 

5. Replicate steps 1>4 30 times. 

6. Calculate the means and variances of the conditional PMC's for 
the 30 replications. 


5.3 Summary of Results 

The complete results of the sampling experiments are given in tables 
8 through 79 in appendix 0. The data contained in each column is described 
below: 

Column 1. Name of the estimator. 
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Coluwn 2 . Average PMC for the 30 replications using where 

J - S, K, G, R, 0, M, A, P, 0, F. 

roiiMim 3 . Variance of the PMC for the 30 replications 
Coluan 4. Average PMC for a biased estimator minus average PMC 
for estimator S evaluated on the 30 replications. 

Column 5 . Nuaiber of tlsws» out of 30, a biased PMC Is lower than 
that of estimator S. 

The actual population values for D along with the associated PMC, denoted 
by OPT, and the orientation of - £2 are given for each table. Note 
that in tables 8-31, ^p/^p “ 0 and 1 < P» In tables 

32-55, dj/pj - dj/pj (l^j); and in tables 56-79, dJ/pj - 0 and dj/ ^ < 

^ 1 + 1 ^* 1+1 ^ ^ Jl ‘* i^'*'l " ^-1 ■ 3 ^ 2 ^ '^’^^^^1 " ^^ 2 ^* 

5. A. Discussion of Results 

In order to compare the performance of the biased procedures to the 

standard unbiased one. It Is necessary to examine the Indicators of 

Improved performance in tables 8-79. The Indicators are colimms 3-4. 

The most striking feature of tables 8-79 Is the dominant Influence of 

the position of vector - £2 on the indicators of Improved performance. 

2 2 

In tables 8-31, £j^ - £2 Is positioned so that 

this position, all biased procedures, except K, showed positive values for 
column 4; and the entry In column 4 for K Is positive when D ^ 1. A com- 
parison of the variances of the estimators in tables 8-31 shows that when 
D < 1.0, the variance of each biased estimator Is greater than the variance 
of the unbiased one; the opposite Is true when D > 1.0, except for biased 
estimators D and K. Indicators In column 5 are generally good for all 
biased estimators except for K, but K was favorable when D > 1. Tables 
32-55 show that the performances of biasing procedures are mixed. Here 
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- ^2 is positioned so that all eigenvectors contribute equally to D 
and the general trend is for all indicators to improve as D gets larger. 
Tables 56-79 show that all biasing procedures performed poorly when 

- U 2 is defined so that d^/4>^ < Although most procedures 

tended to improve on indicators in column 4 and 5 as the value of D 
increased, the general performance of all biasing procedures was poor 
when n ■ 25 and the orientation of - £2 was such that the principal 
components associated with small eigenvalues contributed heavily to D. 

A noticeable exception is K. The amount of improvement in the mean PMC 
for K over the mean PMC for S is considerable when n ■ 10 and D > 1. 

It appears that no firm statements on the effects of eigenvalue size 
or the degree of multicolllnearity can be made, because the effects of 
eigenvalue size seem to depend on the position of the mean vector " £ 2 * 

A comparison of results in tables I through 4 adds support to this claim. 

In tables 1 and 2, ~ —2 * *^-l parallel to and in tables 3 and 

4, 0 - ^2 * *^-p ^ parallel to any then the optimum PMC can be 
achieved by letting K +<». This result was obtained under the assump- 
tion that U, * X. • Tables 1 and 2 show that when U, - U. is parallel to 

-j -2 

the mean PMC of F is close to the optimum PMC and all biased procedures 

2 2 

perform well even though * ^2 * which is the worst multicollln- 

earity case considered in this study. However, in tables 3 and 4, 
performance of F and all other biased procedures is poor, in spite of 
the fact that all configurations are the same as in tables 1 and 2, 
except - U 2 is now parallel to The poor performance of biased 
procedures in tables 3 and 4 is due to the large variances in the com- 
ponents of S~^(X. - X ) as discussed in section 4.2. It is also 

X 
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Table 1 


Comparison of Probabilities of Misclasslficacion for 
Several Discriminant Functions, 30 Replications, 
when D - 1.0 (OPT ■ .3085), U, - U, ■ 

n - 10 ^ 1 

EstliPitor 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Musiber of Times 
PNC is Lower 
than That of 
Estimator S 
(max - 30) 

S 

.4482785 

.0025357 



K 

.4177978 

.0279583 

.0384815 

15 

G 

.3956326 

.0020253 

.0526459 

27 

R 

.3705756 

.0020419 

.0777029 

29 

D 

.3998220 

.0923597 

.0492565 

26 

M 

.3657406 

.0022464 

.0825379 

29 

A 

.3854223 

.0016827 

.0628562 

28 

P 

.3805779 

.0018560 

.0677006 

28 

0 

.4082210 

.0025667 

.0400575 

26 

F 

.3510729 

.0034107 

.0972056 

28 


Table 2 

Comparison of Probabllicics of Mlsclassificatlon for 
Several Discriminant Functions, 30 Replications, 
when D • 1.0 (OPT ■ .3085), U - U - (<(» )**Z 

n - 25 ^ ^ 


Number of Times 


Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

PMC is Lower 
than That of 
Estimator S 
(max « 30) 

S 

.3978117 

.0011525 



K 

.3936133 

.0017926 

.0241985 

20 

C 

.3895865 

.0010908 

.0082253 

21 

R 

.3721815 

.0008544 

.0257103 

30 

D 

.3914080 

.0611284 

.0064038 

20 

M 

.3382653 

.0002799 

.0595464 

30 

A 

.3744258 

.0008205 

.0233860 

28 

P 

.3835509 

.0009451 

.0142668 

26 

0 

.3813010 

.0009768 

.0165107 

29 

F 

.3238669 

.0000759 

.0739449 

30 
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Table 3 


Comparison of Probabilities of Mlsclassiflcatlon for 

Several Discriminant Functions, 30 Replications, 

When D - 1.0 (OPT - .3085), 0- - U- - (« )7Z , 

. _ " “A P “P 


n 


10 


Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Number of Times 
PMC is Lower 
than That of 
Estimator S 
(max ■ 30) 

S 

.4349328 

.0018492 



K 

.4679930 

.0237833 

>.0336602 

11 

G 

.4999833 

.0000000 

-.0650505 

2 

R 

.4999707 

.0000000 

-.0650379 

2 

D 

.5086451 

.0000000 

-.0651123 

2 

M 

.4999633 

.0000000 

-.0650305 

2 

A 

.4999865 

.0000000 

-.0650538 

2 

P 

.4999697 

.0000000 

-.0650569 

2 

0 

.4999484 

.0000000 

-.0650156 

2 

F 

.4999921 

.0000000 

-.0650593 

2 


Table 4 

Comparison of Probabilities of Misclassification for 
Several Discriminant Functions, 30 Replications, 
when D - 1.0 (OPT - .3085), U, - U, ■ (i|» )*SZ , 

n - 25 -2 p -p 


Number of Times 


Estimator 

Mean 

PMC 

Variance 

Improvement Over - 
Estimator S 

PMC is Lower 
than That of 
Estimator S 
(max • 30) 

S 

.3956383 

.0012857 



K 

.4226880 

.0030092 

-.0270577 

2 

C 

.4998305 

.0000000 

-.1042082 

0 

R 

.4999249 

.0000000 

-.1042946 

0 

D 

.4999747 

.0000000 

-. 1043444 

0 

M 

.4999 791 

.0000000 

-.1043487 

0 

A 

.4999810 

.0000000 

-.1043587 

0 

P 

.4999485 

.0000000 

-.1043182 

0 

0 

.4999029 

.0000000 

-.1042726 

0 

F 

.4999881 

.0000000 

-.1043577 

0 
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' • a 1 


vortbuhile to eonsldor the variance of ^ ^ In confbination with the 

auignituda of the components of 3^ - X 2 ' tables 1 throng 4* •* X 2 

is an estimate of - £2 and Var(X^ - X 2 ) ■ £/(nj^ ■** n 2 >; but the magnitudes 
of the components of larger than the magnitudes of the corres- 
ponding components of ^*never is much larger than i|)^. This 

means that X, - X., When used to estimate U, - U. ■ has a greater 

—1 “Z P~P 

chance of being the sero vector and some compenents could change signs 
from sample to sample. 

The above observations suggest that the performance of a biasing 
procedure seems to be related to the ratio 


(Hi - H2 ^’(Hi - V 



(5.7) 


When this ratio is large, say greater than 1/p, as is the case in tables 
8-31, then biasing with a large tends to give good results. In tables 
32-55, note that the ratio in (5.7) becomes 1/p when D > 1 and increases 
(decreases) as D Increases (decreases). Since the simulations of this 
study did not focus on the ratio in (5.7) as a controlled condition, it 

is perhaps worth considering in a future study. 

2 2 

It is also worthwhile to note that when d^/4)^ > there is a 

tendency for the amount of improvement of the biased estimator over the 
unblsed one to increase as the k^'s get larger, as shown by column 4 of 
tables 8-31. Recall from section 5.2 that the biasing constants k^ in A 
and P differ only by the multiple (pf2)/(N-p-2) . When the sample size 
is 2n ■ N « 20, the value of k^ in P is larger than the corresponding k^ in 
A. When the sample size is 2n « N ■ 50, the reverse is true. This differ- 
ence in A and P is also reflected in the relative change in the magnitudes 


46 


of colunn 4 as the saaq>ls slse n changes froa 10 to 25. This observation 
In addition to the behavior of F provides svldanca that for certain poaltlons 
of ~ H 2 * <BK>unt of Inprovesient of a biased estlnator Increase as the 
k^'s get larger. 

The average PNC of a biased estimator may be coiqtared to the average 

PMC of the unbiased one through the application of the two sample t-test, 

t ^ ^ modification of the formula for a given 

2 

population value for D» let 8 ■ maximum variance of the sample PMC; then 
®2 — Hence, t^/ 2 ’^^^/'*^ serve as a conservative value to 

tdilch - X^ nay be compared. That Is, column 4 lists the difference 
between estimator S and all other biased estimators. If any value In this 
column that corresponds to a given biased estimator Is larger than 
ta/ 2 *^ 2 ^/*^> biased estimator gives results that are significantly 

different from that of S at level a. The critical values, C.V. , for the 
three values of D and a ■ .05 are as follows: when D ■ 0.6, C.V. ■ .0156; 

when D * 1.0, C. ■ .0226; when D ■ 3.0, C.V. ■ .0329. 


5.4 Coaeluaion 

This study has «itandsd and ganerallsad recant ptd>llshad work in tha 
area of biased astlMtlon It. discrinlnant analysis. Several Methods of 
biasing the sample linear dlscrlMlnant function have been described and 
compared on the basis of Monte Carlo experiments. The results of tha 
experiments show that no one method Is uniformly best for all configura- 
tions considered, aldiough D give a relatively poor performance In all 
situations studied. It is of special Interest to note that M, A, and F 
did well whenever the ratio in (5.7) was greater than 1/p. These methods 
are particularly effective for the sample size n ■ 10 In combination with 
(5.7) being larger than 1/p. The performance of K was erratic as can be 
seen by comparing its variance to the variance of other estimators. With 
some modification, R seems to have the potential to become a good biased 
procedure for cases where * **1+1* '****" n ■ 25 and dj^/tp^^ > 

F showed the largest positive values for column 4. As mentioned earlier 
and restated here, F Is equivalent to Ignoring the sample variance and 
covariance between the components of X by defining a discrlsilnant function 
where the Identity matrix replaces matrix S. In an applied situation, one 
can easily determine whether F is likely to outperform the standard 
unbiased function S by computing 




i-1 


where are the diagonal entries of matrix S. 
larger than 1/p, then F will probably do better 


(5.8) 


If this ratio Is much 
than S. 
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Finally t an axaalnation of tha alaulation raaults taena to aupport tha 
folloMlng ganaral conclualona: 

1. Tha Mthod of dalatlng tha anallaat alganvaluaa of tha aampla 
corralatlon natrlx glvaa ralativaly poorar parfonunca than tha othar biaaad 
procadura. 

2. Biaaad diacriminant functions labalad by M, A, and F (ana aaction 
5.2 for a dascription) parforsMd b^ttar than all otiiars when - D 2 
poaitionad ao that tha ith principal conponant contributaa nora to tha 
Hahalanobis distanca than tha (l*M)th principal conponant. 

3. Tha affact of snail aiganvaluas in S on biasing procadures dapands 
on tha poaitlot. of tha vactor “ U 2 * 

2 2 

4. When the orientation of - £2 ia such that d^/p^ > 

2 ? 2 

where D - 2, ^./p. is the square of Mahalanobia distancot all biasing 
1-1 ^ ^ 

nethods are particularly effective for snail saisplea. 

In applying Hoerl and Kennard's ridge regression nodel to practical 
problems, a general difficulty lies in the selection of an appropriate 
value for k. Sinilar difticulties exist in choosing a set of k^'s for tha 
biased discriailnant nodels proposed by this paper. However, based on tha 
sinulation results of this study, an applications orlcated user of discri* 
ninant analysis should use the results of an inspection of the following 
two items as an aid in deciding when a biased model should be used: 

1. Eigenvalues of matrix R where R is the sai^le correlation matrix. 

2. The ratio given by (5.8). 

If one or more eigenvalues of R are small, say less than .7, and if the 
ratio (5.8) is larger than 1/p, then it is worthwhile to proceed with the 
selection of a set of k^'s. That is, itens 1 and 2 provide evidence that 
biasing will improve the performance of the discrisdnant function. Given 
that an inspection of items 1 and 2 show that conditions are suitable for 
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biasing, the reconmendatlon here la to construct the unbiased discrimi- 
nant function along with several biased discriminant functions, say A, M, 


I 


f 

It 




K 

i 


I 

I 



I 


i; 

i: 



I 







0 


r 




i 


and F, where the k^'s (or these functions are defined in this section. The 
error rates for the unbiased as well as for the biased discriminant functions 
should be estimated by using one of the methods described in Lachenbruch 
(1975). The discriminant function to use would be the one which gives the 
smallest error rate. 

Lastly, any user should keep in mind that in .n practical situation, 
the efror rate of the population discriminant function is unknown and that 
the above taethod of choosing a discriminant function is simply an effort 
to choose the best classification model possible from the available data. 

The U method, as given by Lachenbruch (1975), of estimating error rates 
seems to be an efficient procedure in terms of using available data. Hence, 
this author rcconmcnds its use in estimating error rates in applied situa- 
tions where a choice is to be made between using one cf the biased discrimi- 
nant functions or the unbiased one. 

Results from this study raise the following questions that should 
merit further study: 

1. For biasing methods using k^, there is an optimum set of k^'s 
(perhaps not a unique set) for each problem, but no technique has been 
developed to compute them. 

2. Additional study is needed to determine how well each biased 
procedure introduced in this paper will perform in multiple group discrimi- 
nation In studying this proble, some consideration should be given the 
orientation of population mean vectors. 

3. Further siady is needed to assess the performance of both the 
two-group and the multi pie-group quadratic discriminant procedures under 
biasing conditions Introduced by this study. 
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APPENDIX A 


CALCUUTIONS LEADING TO THE EQUALITY 
FOR IN SECTION 2.2 

Ut W - (D^(X)|X^, S) - IX - Jj(x^ + X2)1’S“^(X^ - X^). where 
X^, and S are fixed. The condlcionaL probability of mieclaeeifica- 
tioii ubing D (X) is computed by 

PMC - »lCPg(l|2) + Pg(2|l)l , 

where 

P.(l|2) - Pr[W > 01 and P„(2ll) - Pr(W < 0] . 

Since U is a linear function of the components of the multivariate normal 
vector X, V is univariate normal with means and variance (2.15)-(2.17) . 
Hence, 

P (1(2) - PrlW > 01 . PrpL^ > -ZllH) " 

® Uvar(W)P (Var(W)P 

- Pr[Y > y^l , 

W - E(W) 

where Y • r is the univariate standard normal distribution, and 

[Var(W)P 

-E(W) " *‘^^1 " V 

Yi r ■ — z z ji Z7 z — z: — u~ • 

(Var(W)l^ '-1 “ -2^^^ 

By a similar calculation, 

P^(2|l) - Pr(Y < y^l 

where 
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Therefore, in general 


P dl2) • Pr(Y > 
s 


where 


APPENDIX B 


Show that, in general. (S -f K) 4 T*[K * K)T, «rtiere T la the auicrix 
of rlgenvcccors of S, T'T • TT* • I, A « • il®gonal matrix ao 

that i ^2 - *“ - elgenvaluea of S, K • ® 

general diagonal matrix, and S in a p^p symmetric matrix. 

It is clear that S -f K - T'lA KlT if • kj for I ^ J. let it 
be assumed tltat 4 k^ whenever 1 ^ j. Then, 

S + K - T’(A + K)T 
iff 

S + K - T’AT + T'KT 
iff 

S + K • S + T'KT 
iff 

K - T'KT 
iff 

TK - KT . 


Thus, it is sufficient to show that K and T are not generally pormutable. 
Theorem 3, page 223 of Gantmacher (I960) states that, "If two matrices A 
and D are permutable and one of them, say A, has quasi-diagonal form 


A - 


”l j '' 

J 

0 [Aj 




where matrices A^ and A^ do not have characierlstlc values in common, then 
the other matrix B must have the same quasi -diagonal form . . Using 
this theorem. It Is clear that since the general form of K requires that 
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Its diagonal elements are generally pairwise different, a necessary 
condition for permutability between K and T is that T be a diagonal matrix. 
However, T is not generally diagonal; tlterefore, in general, TK KT and 
hence (S K) T'(A K)T for the general diagonal matrix K. 
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APPENDIX C 


D 

D(X> 

D^(X) 

£j(X) 

S 


H 

“j 

OPT 


MOMEMCLATimE 

Nftlialaaobla distanca bctwaen tMO populations. 

Population discrlainant function. 

Conocie repcasentation for any biased discriminant function 
Unbiased sample discriminant function. 

A pKp diagonal Mtrlx with d^^ on the diagonal. 

The probability density function for the Jth population. 

The number of nonzero eigenvalues in matrix S . 

s 

The jth eigenvalue of matrix R. 

A nonnegative bias factor added to the Jth eigenvalue of 
matrix S. 

The jth eigenvalue of matrix S. 

The size of sample from jth population* 

Total optimum probability of miselassification. 

Standard normal cumulative distribution. 

The jth population 
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Pdlj) 


® 1*®2 


: — 

I 

I 

The probability of elasalfylng an obaervatlon Into lAien It 
la really from ir^ (1 ^ j). 

Probability of nlaclaaslfleatlon. 

The Jth eigenvalue of metrlx Z. 

The prior probability of obulnlng an observation from 
Sample correlation matrix. 

The region for classifying Into . 

Sample estimate of matrix Z. 

Generalised inverse of S (when S. Is singular). 

8 8 

Common covariance matrix. 

The Jth population covariance matrix. 

Positive values used to control multlcolllnearlty In Z. 

Sample estimate of -matrix Z^. 

Angle between Jth eigenvector of Z and vector - £ 2 . 

Total probability of mlsclasslflcatlon. 

The Jth population mean. 

Sample estimate of U^. 

Observation vector to be classified. 
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APPENDIX D 


COHTBOL FACTORS FOR SIMIUTIONS 

1. Sampla Sls«: N ■ lO* 25. 

2. MahaUnobis Diatanca: 0 - I(U^ - 
D " 0*69 XaOf 3«0* 

3* Severity of MultlcolUneerity: 

Hatrix 1 : ' 0 ^ ■ . 001 * ■ .001 

Matrix 2: ■ .001* 0 ^ ■ 1.00 

Matrix 3: ■ 10.00, ■ >001 

Matrix 4: 0 ^ • 10.00* ■ 1*00 

See tables 5 and 6 for eigenvalues of the correlation and covariance 
oa trices for the four data natrlces used. 


4. 


Orientation of (U^ • 
■atrlces. 

Orientation 1: 

Orientation 2: 

Orientation 3: 


£ 2 ) to eigenvectors of the four covariance 



vhere ^ ■ the Jth eigenvector of matrix £* 

“3 

■ the Jth eigenvalue of matrix £, 

D ■ Mahalanobls distance between ^ 2* 
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S«« tabl« 7 for specific values of cosO^t where 6^ la the angle between 
2^ and (Uj^ - Oj) . 
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Table 5 


Bigenvaluca of Population Correlation Hatrlees Uaed 


Eigen- 

values 


1 

1 

2 

3 

4 


1 

2.953026 

2.889975 

2.646488 

2.565580 


2 

1.710753 

1.686068 

1.416878 

1.410616 


3 

1.282204 

1.282135 

1.150556 

1.150228 


4 

1.070584 

1.070579 

1.061076 

1.061044 


3 

0.870809 

0.871681 

0.890215 

0.891328 


6 

0.834498 

0.835236 

0.846128 

0.846285 


7 

0.682985 

0.683049 

0.763410 

0.763500 


8 

0.594070 

0.594387 

0.631768 

0.631794 


9 

0.000971 

0.085915 

0.593380 

0.593710 


10 

0.000098 

0.000971 

0.000098 

0.085913 


Table 6 


Eigenvalues of Population Covariance Matrices Used 


Eigen- 

values 


1 

2 

3 

4 


1 

26.192363 

26.291875 

26.357706 

26.453878 


2 

17.175468 

17.224441 

17.226322 

17.280056 


3 

13.037192 

13.152136 

13.399986 

13.494534 


4 • 

12.107236 

12.107237 

12.340140 

12.351819 


5 

8.822387 

8.860483 

9.410080 

9.438447 


6 

7.934043 

7.954317 

8.620528 

8.636769 


7 

7.283433 

7.307305 

7.916816 

7.938362 


8 

5.959092 

5.963760 

7.280512 

7.303905 


9 

0.000970 

0.649324 

5.959092 

5.963760 


10 

0.000667 

0.000970 

0.000667 

0.649323 











Tabu 8 


Coaparlson of Probablllciat of Mlaclaasifleacioii for 
Savaral Dlaeriainanc Functions, 30 Rsplieaclons, 
whan D ■ .6 (OPT • *3821), Oriontaelon 1, 
Matrix 1, n ■ 10 


Eatlaator 

Mean 

PMC 

Variance 

loprovenent Over 
Estiaator S 

Munber of Tines 
PMC is Lower 
than That of 
Estiaator 8 
(aax ■ 30) 

S 

.4721895 

.0007165 

* 

e 

K 

.5004223 

.0182717 

-.0282328 

11 

G 

.4614415 

.0009447 

.0107480 

22 

& 

.4579530 

.0008195 

.0142365 

20 

D 

.4652601 

.0011444 

.0069294 

18 

M 

.4577994 

.0008638 

.0143960 

20 

A 

.4577805 

.0007817 

.0144090 

22 

P 

.4563847 

.0007113 

.0158047 

20 

0 

.4661020 

.0012684 

.0060875 

22 

F 

.4522470 

.0008839 

.0199425 

23 


Table 9 

Coaparison of Probabilities of Misclasslflcatlon for 
Several Dlscriainant Functions, 30 Replications, 
when 0 • .6 (OPT * .3821), Orientation 1, 
Matrix 1, n ■ 25 


Estiaator 

Mean 

PMC 

Variance 

laprovenent Over 
Estiaator S 

Hiaber of Tinas 
PMC is Lower 
than That of 
EstUator S 
(aax « 30) 

S 

.4621413 

.0005747 

* 

* 

K 

.4665843 

.0012835 

-.0044430 

15 

C 

.4594191 

.0007630 

.0027222 

19 

R 

.4350403 

.0007587 

.0071010 

23 

D 

.4597875 

.0007535 

.0023538 

17 

M 

.4467804 

.0007187 

.0153609 

26 

A 

.4553200 

.0008518 

.0068213 

20 

P 

.4578666 

.0007875 

.0042747 

18 

0 

.4570331 

.0007556 

.0051082 

21 

F 

.4394531 

.0006497 

.0226882 

28 
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T«bl« 10 


Coapariion of Probabillclaa of Misclaasifieatlon for 
Savaral Diterlr'i .mt Functions, 3u Raplicaciona* 
fihan D ■ .6 (OPT • ,3821), Orientation 1, 
Matrix 2, n • 10 


Bstiaator 

Mean 

PMC 

Variance 

Zaproveaent Over 
Estimator 8 

Munbar of Tines 
PMC is Lower 
than That of 
Eatlaator 8 
(max • 30) 

8 

.4731673 

.0006971 

* 

* 

X 

.5039592 

.0161971 

-.0327919 

11 

0 

.4616263 

.0008701 

.0115407 

22 

R 

.4576283 

.0006411 

.0155389 

23 

D 

.4700283 

.0013315 

.0031390 

16 

M 

.4375804 

.0006680 

.0155869 

21 

A 

.4574815 

.0007038 

.0156858 

23 

P 

.4558591 

.0006424 

.0173082 

22 

0 

.4655391 

.0010692 

.0076281 

21 

F 

.4533410 

.0009433 

.0196263 

23 


Table 11 

CoBparlsan of Probabilities of Hlsclassificatlon for 
Several Discriminant Functions, 30 Replications, 
triten 0 • .6 (OPT ■ .3821), Orientation 1, 
Matrix 2, a « 25 


Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Number of Tines 
PMC Is Lower 
chan That of 
Estimator S 
(max • 30) 

8 

.4619189 

.0006033 

* 

* 

X 

.4613094 

.0007568 

.0006095 

17 

c 

.4591505 

.0007032 

.0027684 

20 

E 

• .4553493 

.0007457 

.0065696 

22 

D 

.4597061 

.0007579 

.0022128 

17 

M 

.4486507 

.0006848 

.0132681 

23 

A 

.4558683 

.0009036 

.0060505 

20 

P 

.4578710 

.0008033 

.0040479 

21 

0 

.4570830 

.0007464 

.0048358 

22 

F 

.4412313 

.0006982 

.0206876 

27 
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Tabu 12 

.Coaparlaoa of Probabilitlaa of MlaeUsatfieatlon for 
Savaral Dlaerlminane Funeciona, 30 Raplieaciom, 
ohta D ■ 1.0 (OPT • .3005) • Orlaneaelen I, 
Haerlx 1, a ■ 10 


Eatiaator 

Maaa 

PMC 

▼arlaaca 

Zaprevaaaat Ovar 
Eatiaator S 

E*«bar of Tiaaa 
PMC la Lovar 
than That of 
Eatiaator 8 
(aax - 30) 

S 

.4352555 

.0019316 

a 

a 

K 

.4102568 

.00259528 

.0249987 

17 

C 

.4036046 

.0018006 

.0316509 

24 

E 

•4026084 

.0016065 

.0326471 

22 

D 

.4136102 

.0022986 

.0216454 

19 

M 

.4021568 

.0016737 

.0330987 

22 

A 

.3978281 

.0014702 

.0374275 

25 

P 

.3963853 

.0013612 

.0388703 

25 

0 

.4146123 

.0024739 

.0206432 

22 

P 

.3885997 

.0016683 

.0466558 

24 


Tabu 13 


Coaparlson of Probabilitlaa of KUeUaalficaeloa for 
Savaral Dlaerlmloaac Fuaetloaa, 30 Saplleacloaa. 
lOiaa 0 ■ 1.0 (OPT ■ .3085) . Orlaataclon 1* 
Matrix 1, a • 25 


I 

^ Eatiaator 


I 

i, 


Maaa 

PMC 


▼arianca 


Xaprovaaant Ovar 
Eatiaator S 


Muabar of Tiaaa 
PMC la Lover 
thaa That of 
Eatiaator S 
(aax • 30) 


I 



S 

.4021451 

.0013089 

a 

K 

.3973249 

.0019124 

.0048202 

C 

.3946661 

.0013566 

.0074790 

E 

.3846051 

.0011317 

.0175400 

D 

•3962315 

.0013599 

.0039136 

M 

.3723917 

.0007196 

.0297534 

A 

.3830196 

.0012260 

.0191255 

P 

.3897421 

.0012888 

.0124030 

0 

.3892117 

.0012410 

.0129334 

P 

.3619399 

.0004518 

.0402052 


19 

25 

27 
21 
25 
25 

25 

26 

28 
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T«bl« 14 


CoBptrlton of PrebobUltlM of Mlaelaosiflaicloii for 
Sovoral OlienalMoe Funetionf, 30 Roplieaelono, 
wh«n D • 1.0 (OPT ■ .3083), Orioneatlon 1, 
Hacrlx 2, a • 10 


BsclMtor 


Hoaa 

PMC 


Variaaeo 


Xaprovoaoat Over 
Batiaotor S 


lhanor of Tlaaa 
PMC la Lowar 
than That of 
Baciaator 8 


s 

.4337742 

.0018393 

* 

a 

K 

.4207499 

.0226436 

.0150244 

IS 

6 

.4027121 

.0016342 

.0330621 

26 

% 

.3996047 

.0012180 

.0361695 

22 

0 

.4187074 

.0026833 

.0170668 

18 

M 

.3988943 

.0012721 

.0368800 

23 

A 

.3961644 

.0013449 

.0396099 

27 

P 

.3943118 

.0012238 

.0414625 

23 

0 

.4133403 

.0021D5 

.0222340 

24 

F 

.3892398 

.0013826 

.0465143 

23 


Tabla 13 

CoBpariacn of Probablllelta of Mlaelaaaifleatlon for 
Savaral Dlaerialnanc Funcciona, 30 Raplieaclona, 
whan D - 1.0 (OPT - .3083), Oriantatioa 1, 
Matrix 2, n ■ 10 


Batlaator 


Ha an 

PMC 


.4012635 

.3955002 

•3944340 

.3842272 

.3935788 

.3743676 

.38391)5 

.3893937 

.3892706 

.3642246 


Tarjtanea 


.0013480 

.0015308 

.0012847 

.0010717 

.0013032 

.0007243 

.0012346 

.001265^ 

.0012074 

.0003034 


Xaprovaaaat Ovar 
Batlaator 8 


Rtalbar of Tlaaa 
PMC la lowar 
chan That of 
Baciaator S 
(aax ■ 30) 


.0057632 

.0068093 

.0170362 

.0056846 

.0268939 

.0173500 

.0116697 

.0119929 

.0370389 


1 



Tabu 16 


Conparison of Probabilities of Misclaesifieatlon for 
Several Discriminant Functions, 30 Replications, 
when 0 ■ 3.0 (OPT > ;0668) , Orientation 1, 
Matrix 2, n • 10 

Estimator 

Mean 

PMC 

Variance 

Ii^irovement Over 
Estimator .S 

Mtsaber of Times 
PMC is Lover 
than That of 
Estimator S 
(max > 30) 

S 

.2532152 

.0085259 

* 

* 

K 

.1302365 

.0126336 

.1229787 

27 

6 

.1368733 

.0016761 

.1163419 

30 

R 

.1323025 

.0011064 

.1209127 

28 

D 

.1745411 

.0024220 

.0786741 

22 

M 

.1312406 

.0010598 

.1219746 

28 

A 

.1241410 

.0011634 

.1290742 

30 

P 

.1216026 

.0009846 

.1316126 

29 

0 

.1634272 

.0029187 

.0897879 

25 

P 

.1094501 

.0002702 

.1437650 

29 


Table 17 

Comparison of Probabilities of Misclasslflcatlon for 
Several Discriminant Functions, 30 Replications, 
irtien D ■ 3.0 (OPT » .0668), Orientation 1, 
Matrix 2, n « 25 


Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Number of Times 
PMC is Lower 
than That of 
Estimator S 
(max • 30) 

S 

.1339836 

.0019074 

* 

* 

K 

.1106339 

.0013456 

.0233497 

29 

C 

.1156332 

.0013953 

.0153504 

27 

R 

.1026748 

.0006175 

.0313068 

28 

D 

.1225503 

.0015961 

.0114333 

23 

M 

.0982620 

.0002503 

.0357216 

26 

A 

.0981882 

.0005615 

.0357954 

29 

P 

.1082184 

.0008354 

.0257652 

28 

0 

.1099456 

.0008859 

.0240381 

28 

F 

.0965553 

.0001367 

.0354283 

26 
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Table 18 


Conparlson of Probabilities of Mlsclassif Ication for 
Several Discriminant Functions, 30 Replications, 
when D • 3.0 (OPT • .0668), Orientation 1, 
Matrix 2, n ■ 10 


Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Humber of Times 
PMC is Lower 
than That of 
Estimator S 
(max ■ 30) 

S 

.2565326 

.0080737 

* 

* 

R 

.1228519 . 

.0102818 

.1336807 

27 

G 

.1344490 

.0015525 

.1220836 

30 

R 

.1285779 

.0007489 

.1279547 

29 

D 

.1761069 

.0032892 

.0804257 

25 

M 

.1285241 

.0007831 

.1280085 

28 

A 

.1210284 

.0010253 

.1355042 

30 

P 

.1180248 

.0008606 

.1385078 

30 

0 

.1618516 

.0024774 

.0946810 

30 

F 

.1697688 

.0003251 

.1467638 

30 


Table 19 

Comparison of Probabilities of Mlsclassif ication for 
Several Discriminant Functions, 30 Replications, 
when D • 3.0 (OPT - .0668) , Orientation 1, 
Matrix 2, n • 25 


Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Number of Times 
PMC is Lower 
than That of 
Estimator S 
(max ■ 30) 

S 

.1317634 

.0017680 

* 

* 

K 

.1180470 

.0015949 

.0137164 

30 

G 

.1193976 

.0012774 

.0123658 

28 

R 

.1007304 

.0004446 

.0310330 

30 

D 

.1210195 

.0012470 

.0107439 

21 

M 

.0962238 

.0002184 

.0355395 

27 

A 

.0982682 

.0004192 

.0334952 

29 

P 

.1076276 

.0006695 

.0241358 

28 

0 

.1095399 

.0007521 

.0222235 

28 

F 

.0989428 

.0021467 

.0328205 

23 
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Table 20 


‘U. 

Hi 




Comparison of Probabilities of Misclassif ication for 
Several Discriminant Functions, 30 Replications, 



when D • 

.6 (OPT ■ .3821), Orientation 1 
Matrix 3, n • 10 

• 

Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Number of Times 
^C is Lover 
than That of 
Estimator S 
(max - 30) 

S 

.4733989 

.0006782 

* 

* 

K 

.4971384 

.0145314 

-.0237395 

10 

G 

.4626321 

.0008694 

.0107668 

21 

R 

.4553495 

.0009316 

.0180494 

21 

D 

.4658064 

.0011544 

.0075925 

18 

M 

.4552476 

.0009972 

.0181514 

21 

A 

.4576955 

.0007795 

.0157034 

22 

P 

.4559194 

.0007378 

.0174795 

22 

0 

4680424 

.0010460 

.0053565 

19 

F 

.4528751 

.0009568 

.0205238 

23 


Table 21 

Comparison of Probabilities of Misclassif ication for 
Several Discriminant Functions, 30 Replications, 
vhen D - .6 (OPT ■ .3821), Orientation 1, 
Matrix 3, n • 2S 


Estimator 


Mean 

PMC 


Variance 


Improvement Over 
Estimator S 


Number of Times 
PMC is Lower 
than That of 
Estimator S 
(max • 30) 


. i 

s 

.4619080 

.0005039 

* 

* 

: j 

K 

.4668184 

.0011250 

-.0049104 

15 

' 

G 

.4592696 

.0006578 

.0026385 

20 

* M 

R 

.4531585 

.0006627 

.008749 5 

24 


0 

.4602147 

.0006546 

.0016933 

17 

1 ^ 

M 

.4418609 

.0007154 

.0200471 

23 


A 

.4531532 

.0007265 

.0087548 

23 

) 

P 

.4568855 

.0006745 

.0050225 

21 

r i ^ 

0 

.4566850 

.0006546 

.0052231 

22 

r i 

' If 

F 

.4379801 

.0005992 

.0239279 

27 




T«bl« 22 





Cottparison of Probabllieles of Mloc lass if loot Ion for 
Savaral Dlacrlainane Functions, 30 Rapll cations, 
when D • .6 (OPT ■ .3821), Orientation I, 
Matrix 3, n ■ 10 


Estlaator 

Mean 

PMC 

Variance 

Iaq)rovement Over 
Estimator S 

Humber of Times 
PMC Is Lower 
than That of 
Estimator S 
(max ■ 30) 

S 

.4738649 

.0006150 

* 

* 

K 

.5022683 

.0136387 

-.0284034 

10 

6 

.4622541 

.0007822 

.0116107 

21 

R 

.4556584 

.0007593 

.0182065 

23 

D 

.4675753 

.0012553 

.0062896 

19 

M 

.4548204 

.0007921 

.0190445 

23 

A 

.4575199 

.0007348 

.0163449 

22 

P 

.4556534 

.COO 7061 

.0182U4 

24 

0 

.4671534 

.0008232 

.0067114 

20 

F 

.4543066 

.0010736 

.0195582 

25 




Table 23 


Conparlson of Probabilities of Misclassificatlon for 
Several Dlscrlalnant Functions, 30 Replications, 
when D « .6 (OPT - .3821), Orientation 1, 
Matrix 3, n > 25 


Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Number of Times 
PMC is Lower 
than That of 
Estlaator S 
(max • 30) 

S 

.4620522 

.0005445 

* 

* 

K 

.4624846 

.0007348 

-.0004324 

16 

G 

.4596430 

.0006425 

.0024092 

21 

R 

.4533U7 

.0006838 

.0087405 

24 

D 

.4610649 

.0006666 

.0009872 

18 

M 

.4417442 

.0006493 

.0203080 

27 

A 

.4533371 

.0007548 

.0087151 

23 

P 

.4573193 

.0007006 

.0047329 

20 

0 

.4576000 

.0006637 

.0044522 

23 

F 

.4398371 

.0006321 

.0222151 

26 
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Tabu 24 
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Comparison of Probablllclaa of Misc lass If lea cion for 
Savaral Dlscrlmlnanc Functions • 30 Rapllcatlons, 
whan D ■ 1.0 (OPT - .3083), Orientation 1, 
Matrix 3, n • 10 


Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Number of Times 
PMC Is Lower 
than That of 
Estimator S 
(max > 30) 

S 

.4357952 

.0018402 

* 

* 

K 

.4189812 

.0200049 

.0168140 

17 

G 

.4071377 

.0017119 

.0286575 

24 

R 

.3964953 

.0018037 

.0392999 

22 

D 

.4145193 

.0021565 

.0212759 

20 

M 

.3958199 

.0019141 

.0399753 

22 

A 

.3988230 

.0015270 

.0369722 

25. 

P 

.3962343 

.0014592 

.0395609 

24 

0 

.3192497 

.0021358 

.0165455 

22 

F 

.3888942 

.0017892 

.0469010 

24 


Table 25 

Comparison of Probabilities of Mlsclasslflcatlon for 
Several Discriminant Functions, 30 Replications, 
when D * 1.0 (OPT <■ .3085), Orientation 1, 
Matrix 3, n • 10 

Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Humber of Times 
PMC Is Lower 
chan That of 
Estimator S 
(max <■ 30) 

S 

.4018569 

.0010668 

* 

* 

K 

.4007859 

.0015441 

.0010709 

19 

G 

.3951770 

.0010529 

.0066799 

24 

R 

.3814448 

.0008253 

.0204121 

27 

D 

.3980616 

.0010812 

.0037952 

17 

M 

.3637072 

.0005564 

.0381497 

29 

A 

.3802136 

.0009138 

.0216433 

28 

P 

.3890553 

.0009586 

.0128015 

27 

0 

.3892713 

.0009437 

.0125866 

27 

F 

.3591038 

.0003616 

.0427531 

29 


i 


I 
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Table 26 


tt 


Comparison of Probabiliclea of Mis classification for 
Several Discriminant Functions. 30 Replications, 
vhen D ■ 1.0 (OPT .3085), Orientation 1, 

Hatrlx 4, n - 10 


Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Number of Times 
PMC Is Lower 
than That of 
Estimator S 
(max • 30) 

S 

.4354159 

.0017285 

* 

* 

K 

.4251562 

.0195306 

.0102597 

16 

G 

.4057336 

.0015603 

.0296823 

24 

R 

.3958520 

.0013321 

.0395640 

25 

•D 

.4158819 

.0024094 

.0195341 

19 

M 

.394U98 

.0013734 

.0412961 

23 

A 

.3973010 

.0014017 

.0381150 

25 

P 

.3943527 

.0013213 

.0410632 

25 

0 

.4184096 

.0017654 

.0170063 

23 

F 

.3901320 

.0018553 

.0452840 

23 


I _ 


Table 27 


Comparison of Probabilities of Miaclassification for 
Several Discriminant Functions, 30 Replications, 
when D - 1.0 (OPT - .3085), Orientation 1, 
Hatrlx 4, n ■ 25 


Estimator 


Mean 

PMC 


Variance 


Improvement Over 
Estimator S 


Number of Times 
PMC Is Lower 
than That of 
Estimator S 
(max - 30) 


S 

R 

G 

R 

0 

M 

A 

P 

0 

F 


.4021550 

.3981924 

.3956099 

.3815127 

.3985176 

.3631856 

.3805082 

.3893249 

.3907515 

.3616395 


.0011225 

.0013287 

.0010401 

.0007640 

.0010365 

.0004710 

.0009086 

.0009615 

.0009609 

.0004015 


.0039626 

.0065451 

.0206423 

.0036374 

.0389694 

.0216468 

.0128301 

.0114035 

.0405156 


24 

26 

27 
20 
30 

28 
27 
27 
29 
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A, 



Table 28 




1 . 


Comparison of Probabilities of Mlsclasslfleaelon for 
Several Discriminant Functions, 30 Replications, 
When D ■ 3.0 (OPT • .0668), Orientation 1, 


I 


I • 1, 


! 


Matrix 3, 

9 

s 

o 


1 Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Number of Times 
PMC is Lower 
than That of 
Estimator S 
(max • 30) 

1. s . 

.2535330 

.0087476 

* 

* 

K 

.1433302 

.0166587 

.1102027 

26 

), G 

.1390474 

.0020212 

.U44855 

30 

1 R 

.1217244 

.0009741 

.1318086 

28 

D 

.1678301 

.0022194 

.0857029 

23 

M 

.1198355 

.0008380 

.1336974 

29 

! A 

.1243295 

.0013497 

.1292035 

20 

p 

.1209760 

.001U45 

.1325570 

29 

0 

.1668324 

.0036922 

.0867086 

29 

i F 

.1062285 

.0002769 

.1473044 

29 


Table 29 

Conparlson of Probabilities of Mlsclassification for 
Several Discriminant Functions, 30 Replications, 
vhen D ■ 3.0 (OPT • .0668), Orientation 1, 
Hatrlx 3, n • 25 


Estimator 


Kean 

PMC 


Variance 


lotprovement Over 
Estimator S 


Number of Times 
PMC is Lower 
than That of 
Estimator S 
(max ■ 30) 


• f ' 

s 

.1333216 

.0017915 

* 

* 


K 

.U92148 

.0013915 

.0141068 

28 

l 

G 

.U98772 

.0012919 

.0134444 

29 


R 

.0998444 

.0004592 

.0334772 

30 

■ ]' 

D 

.1223266 

.0014919 

.0109950 

25 

: ( 

M 

.0922712 

.0001474 

.0410504 

28 


A 

.0968348 

.0004249 

.0364867 

28 

■' \ ' 

P 

.1081441 

.0007065 

.0251775 

29 

■ L 

0 

.1102384 

.0007987 

.0230832 

28 


F 

.0944551 

.0001096 

.0388664 

25 





c 

I 

L 
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TabU 30 

Coaparison of Probablllclat of Mlsclasslflcacion for 
Several Discrlalnanc Funeciona, 30 Repllcaeiona, 
when D ■ 3.0 (OPT > .0668) , Orientation 1* 
Matrix 4, n * 10 


Estlnator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Musber of Times 
PMC la Lower 
than That of 
Estimator S 
(max ■ 30) 

S 

.2558041 

.0089398 

* 

* 

K 

.1336960 

.0123173 

.1221051 

28 

G 

.1333707 

.0016500 

.1224334 

30 

R 

.1214726 

.0005824 

.1343315 

29 

D 

.1590654 

.0025688 

.0967387 

26 

M 

.1208897 

.0005408 

.1349144 

28 

A 

.1200201 

.0011033 

.1357840 

30 

P 

.1166758 

.0009311 

.1391283 

30 

0 

.1671048 

.0032112 

.0886993 

30 

F 

.1070920 

.0003425 

.1487121 

29 


Table 31 




Comparison of Probabilities of Mlsclasslflcatlon for 
Several Discriminant Functions, 30 Replications, 
when D - 3.0 (OPT - .0668), Orientation 1, 
Matrix 4, n * 25 

Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Number of Times 
PMC Is Lover 
than That of 
Estimator S 
(max ■ 30) 

S 

.1328329 

.0015465 

* 

* 

K 

.1205019 

.0011633 

.0123709 

30 

C 

.1196864 

.0009976 

.0131464 

30 

R 

.0983347 

.0002869 

.0344982 

29 

D 

.1208810 

.0010090 

.0119519 

21 

M 

.0907088 

.0000900 

.0421240 

29 

A 

.0954688 

.0002912 

.0373640 

29 

P 

.1072921 

.0005331 

.0255408 

30 

0 

.1113893 

.0006753 

.0214436 

30 

F 

.0951386 

.0001210 

.0376943 

25 
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Tablt 32 


Comparison of Probabillcies of Mlsclasalf Icaelon for 
Several Dlscrlmlnanc Funccions, 30 Replications, 
when 0 ■ .6 (OPT ■ .3821), Orientation 2, 
Matrix 1, n ■ 10 


Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Number of Times 
PMC is Lower 
Chan That of 
Estimator S 
(max ■ 30) 

S 

.4748426 

.0009860 

* 

* 

K 

.5155888 

.0148181 

-.0407462 

8 

G 

.4696251 

.0007778 

.0052174 

18 

R 

.4675198 

.0006312 

.0073228 

17 

D 

.4732984 

.0008981 

.0015442 

17 

M 

.4677484 

.0006629 

.0070941 

17 

A 

.4672965 

.0006134 

.0075461 

14 

P 

.4664045 

.0005490 

.0084381 

15 

0 

.4731800 

.0010791 

.0016625 

18 

F 

.4654012 

.0006399 

.0094413 

20 


Table 33 

Comparison of Probabilities of Misclasslflcation for 
Several Discriminant Funccions, 30 Replications, 
when D ■ .6 (OPT ■ .3821), Orientation 2, 
Matrix 1, n • 25 


Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Number of Times 
PMC is Lower 
than That of 
Estimator S 
(max • 30) 

S 

.4627196 

.0006670 

* 

* 

K 

.4719242 

.0014018 

-.0092046 

10 

G 

.4694026 

.0009036 

-.0066830 

6 

R 

.4668919 

.0009133 

-.0041723 

14 

D 

.4695432 

.0008943 

-.0068236 

6 

M 

.4616642 

.0008998 

.0010555 

18 

A 

.4676708 

.0010138 

-.0049511 

12 

P 

.4687074 

.0009316 

-.0059878 

9 

0 

.4679746 

.0009010 

-.0052550 

10 

F 

.4581825 

.0008614 

.0045371 

17 
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T«blt 34 


Conp«rlson of Probabillcios of HloeUtsiflcatlon for 
Sovoral Dlserlaliuac Funetiono, 30 Roplleatlona , 
whon D ■ *6 (OPT - .3821), Orimtatlon 2, 
^trlx 2, n ■ 10 


Estlauitor 

Mean 

PMC 

Variance 

Improvement Over 
Estimator 8 

RWber of Timas 
PMC Is Lower 
than That of 
Estimator S 
(max ■ 30) 

8 

.4748407 

.0009862 

* 

* 

K 

.5124921 

.0142417 

-.0376514 

9 

G 

.4689430 

.0007691 

.0058977 

17 

R 

.4666333 

.0004998 

.0082074 

18 

D 

.4777U8 

.0010373 

-.0028711 

15 

M 

.4670981 

.0005090 

.0077425 

17 

A 

.4665159 

.0005723 

.0083248 

14 

P 

.4655785 

.0005018 

.0092622 

15 

0 

.4713298 

.0010116 

.0035109 

19 

P 

.4658244 

.0006795 

.0090163 

17 


Table 35 

CoBparlsoa of Probabilities of Misclasslflcatlm for 
Several Diserlffllnaac Functions, 30 Replications, 
when D • .6 (OPT - .3821) , Orientation 2, 
Hatrix 2, n • 25 


Estimator 

Mhan 

PMC 

Variance 

Improvement Over 
Estimator 8 

Humber of Times 
PMC la Lower 
than That of 
Eatlmator 8 
(max • 30) 

S 

.4627164 

.0006672 

* 

* 

K 

.4644286 

.0007672 

-.0017121 

12 

C 

.4660719 

.0007766 

-.0033555 

8 

R 

.4654612 

.0008201 

-.0027447 

13 

D 

.4692930 

.0008660 

-.0065766 

6 

M 

.4625991 

.0007796 

.0001173 

U 

A 

.4668268 

.0009913 

-.0041104 

U 

P 

.4669938 

.0008796 

-.0042774 

U 

0 

.3657500 

.0008180 

-.0030336 

12 

F 

.4591060 

.0008321 

.0036104 

17 
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Table 36 


c 


1C 

II 

* 

\ 

;c 

r 

* w- . 

j: 

f ;c 


Coaparlson of Probabllleiaa of Mlsclassiflcaeloii for 
Savaral DiseriBinane Functions, 30 Rapllcationa, 
whan D ■ 1.0 (OPT ■ .3085), Orientation 2« 
Matrix 1, n • 10 


Estlaator 

Mean 

PMC 

Variance 

laproveaent Over 
Estimator S 

tluaber of Times 
PMC Is Lower 
than That of 
Estimator S 
(max ■ 30) 

S 

V 

.4353746 

.0015345 

* 

e 

Ih 

.4330599 

.0203526 

.0023147 

14 

V 

o 

.4219073 

.0014967 

.0134672 

18 

R 

Tb 

.4226489 

.0013245 

.0127256 

18 

U 

M 

.4308677 

.0018493 

.0045069 

16 

n 

A 

.4230547 

.0013879 

.0123198 

18 

A 

p 

.4179050 

.00U787 

.0174696 

17 

o 

.4171228 

.0010810 

.0182518 

21 

V 

.4311192 

.0021821 

.0042554 

18 


.4151206 

.0012979 

.0202540 

22 


I / 


Table 37 

Conparlson of Probabilities of Mlsclasslflcation for 
Several Diserlalnant Functions, 30 Replications, 
when 0 • 1.0 (OPT • .3085), Orientation 2, 
Matrix 1, n ■ 25 


Estisator 


Mean 

PMC 


Variance 


laprovenent Over 
Estlaator S 


Mueber of Times 
IMC Is Lower 
dian That of 
Estlaator S 
(max ■ 30) 


S 

.4032747 

.0014401 

e 

e 

K 

.4051522 

.0022599 

-.0018774 

16 

G 

.4152298 

.0017743 

-.0119551 

9 

R 

.4079954 

.0015654 

-.0047206 

12 

D 

.4164404 

.0017685 

-.0131657 

9 

M 

.3989645 

.0011283 

.0043102 

16 

A 

.4069522 

.0016618 

-.0036775 

14 

P 

.4U7239 

.0017241 

-.0084492 

12 

0 

.4112504 

.0016739 

-.0079757 

11 

F 

.3917771 

.0008223 

.011497/ 

17 




c 


Tablt 38 





I CoBpariton of Probabllitloi of Miaclataifleacioii for j 
i { Sevtral Diacrlalnanc Functions, 30 Rapllcaelona, | 
' (_ whan D ■ 1.0 (OPT ■ .3085), Orientation 2, 

I Matrix 2, n - 10 


Eatiaator 

Mean 

PMC 

Variance 

Zaproveaent Over 
Eatiaator 8 

Huaber of Tlaes 
PMC is Lower 
than That of 
Estlaacor 8 
(aax - 30) 

S 

.4353760 

.0015353 

• 

* 

K 

.4288924 

.0192817 

.0064836 

IS 

G 

.4197841 

.0014778 

.0155919 

19 

R 

.4194766 

.0010147 

.0158995 

19 

D 

.4386382 

.0021675 

-.0032621 

13 

M 

.4196677 

.0010474 

.0157083 

19 

A 

.4154792 

.0011159 

.0198969 

18 

P 

.4145854 

.0009961 

.0207906 

20 

0 

.4278291 

.0020048 

.0075470 

18 

F 

.4149475 

.0013041 

.0204286 

22 


i 

j ' 




i 


Tabu 39 


Comparison of Probabilities of Misclassification for 
Several Discriminant Functions, 30 Replications, 
irtien D ■ 1.0 (OPT - .3085), Orientation 2, 
Matrix 2, n ■ 25 


Eatiaator 

Mean 

PMC 

Variance 

laprovement Over 
Estimator S 

NUaber of Tinea 
PMC is Lover 
than That of 
Eatiaator S 
(aax • 30) 

8 

.4032655 

.0014404 

* 

* 

K 

.4003054 

.0016533 

.0029601 

18 

G 

.4080359 

.0015667 

-.0047704 

9 

R 

.4042938 

.0013951 

-.0010283 

14 

D 

.4158558 

.0017483 

-.0125903 

9 

M 

.3998658 

.0009956 

.0033997 

12 

A 

.4057463 

.0016503 

-.0024808 

13 

P 

.4079364 

.0016128 

-.0046709 

13 

0 

.4062713 

.0015187 

-.0030058 

14 

F 

.3935481 

.0008284 

.0097173 

14 


i 


Tabu 40 


Cnparlaen of Probabillelaa of MiaeUaalfieacion for 
Savaral Dlacrlminaac Fttneclonst 30 Raplleatiaaa* 
«han D ■ 3.0 (OPT ■ .0668), Oriancation 2, 
tfacrix 1, a ■ 10 


Eatlaator 

Maaa 

PMC 

Varlaaea 

ZapravaoMat Ovar 
Eatlaator 8 

MWbar of Tiaaa 
PMC la Lowar 
thaa That of 
Eatlaator 8 
(aax ■ 30) 

8 

.2493078 

.0060483 

a 

a 

K 

.1123621 

.0066941 

.1371437 

29 

6 

.1692414 

.0020379 

.0802664 

27 

R 

.1709730 

.0013073 

.0783348 

26 

D 

.2103048 

.0029329 

.0390030 

20 

M 

.1707136 

.0013022 

.0787921 

27 

A 

.1577432 

.0014461 

.0917623 

26 

P 

.1366116 

.0013031 

.0928962 

26 

0 

.1952627 

.0031342 

.0342430 

23 

F 

.1314887 

.0007328 

.0980191 

27 


Tabla 41 

Cooparlton of Probabllltlos of Mlselaaaifleaeloa for 
Savaral Dlserlalnaae Funetloaa, 30 Saplleationa. 
iriian D ■ 3.0 (OPT • .0668) , Orimcaclon 2, 
Matrix 1, n ■ 23 


latlaator 


Maaa 

PMC 


Varlaaea 


laprovManc Ovar 
Eatlaacor 8 


Kuiftar of Tinas 
PMC la Lowar 
than That of 
Eatlaator 8 
(nax • 30) 


.1331733 

.00:9357 

a 

.U44944 

.0012634 

.0186789 

.1477342 

.0016095 

-.0143809 

.1316691 

.00078U 

.0013042 

.1322981 

.0018516 

-.0191248 

.1279312 

.0004230 

.0052221 

.1253118 

.0006375 

.0076613 

.1364622 

.0009679 

-.0032889 

.1390306 

.0010629 

-.0058573 

.1303412 

.0002908 

.0028321 


22 

7 

13 

6 

13 

13 

13 

12 

12 
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T«bl« A2 


Coapariton of Probabilities of Misclataifieaelon for 
Several Discriainanc Functions, 30 Replica tioaa, 
when D ■ 3.0 (OPT ■ .0668), Orientation 2, 
Matrix 2, n ■ 10 


Eeciaator 


Mean 

PMC 


Variance 


laprovenent Over 
Eeciaator S 


Nuaber of Tiaes 
PMC is Lower 
than That of 
Eatlaetor 8 
(aax ■ 30/ 


s 

.2495055 

.0060521 

* 

K 

.1078930 

.0062406 

.1416125 

C 

.1632755 

.0019554 

.0862300 

R 

.1633573 

.0011703 

.0861462 

0 

.2102215 

.0030642 

.0392840 

M 

.1644306 

.0012311 

.0850749 

A 

.1523755 

.0013742 

.0971300 

P 

.1512669 

.0011787 

.0982386 

0 

.1874490 

.0028035 

.0620565 

F 

.1493153 

.0008663 

.1001900 


29 

27 

27 

19 

27 

27 

27 

27 

27 


Table 43 

Comparison of Probabilities of Misclasslfication for 
Several Discrialnant Functions. 30 Replications, 
when D - 3.0 (OPT - .0668), Orientation 2, 
Matrix 2, n • 23 


Estimator 


Mean 

PMC 


Variance 


Xaproveaent Over 
Estimator S 


fluBbei of Times 
PMC is Lover 
than That of 
Estimator S 
(aax > 30) 


S 

K 

C 

E 

0 

M 

A 

p 

0 

F 


1331843 

.0019554 

1195333 

.0016524 

1364841 

.0016179 

1232534 

.0005845 

1508638 

.0016469 

1247775 

.0003938 

1222640 

.0004822 

1291367 

.0007964 

129 7228 

.0009705 

1297631 

.0002820 


* 

* 

.0136510 

25 

.0032998 

12 

.0099309 

15 

0 1 76795 

6 

,0084068 

15 

0109203 

16 

0040476 

15 

0034615 

16 

0024212 

U 


78 




S 

.4748412 

.0009861 

* 

* 

! . 

K 

.5065296 

.0128271 

-.0316884 

8 

. 

G 

.4673149 

.0007657 

.0075263 

16 

\ 

j 

R 

.4629258 

.0007699 

.0119154 

19 

1 . 

0 

.4712337 

.0010013 

.0036075 

17 


M 

.4634376 

.0008083 

.0114036 

21 

j ■ ’ 
! 

A 

.4645034 

.0006641 

.0103378 

16 


P 

.4637172 

.0006158 

.0111240 

17 


0 

.4711998 

.0009542 

.0036414 

17 

1 

F 

.4638498 

.0007231 

.0109914 

22 


Table 45 

Cooparlson of Probabilities of Mis classification for 
Several Discriminant Functions, 30 Replications, 
«^en D * .6 (OPT - .3821), Orientation 2, 
Matrix 3, n • 25 


Estimator 


Mean 

PMC 


Variance 


Improvement Over 
Estimator S 


Number of Times 
PMC is Lower 
than That of 
Estimator S 
(max ■ 30) 




Table 46 


Comparison of Probabillclea of Misclaasifleaeion for 
Several Dlscrimlnane Functions, 30 Replleationa, 
when D - .6 (OPT > .3821), Orientation 2, 

Matrix 4, n ■ 10 


Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Number of Times 
PMC is Lower 
than That of 
Estimator S 
(max • 30) 

S 

.4748412 

.0009861 

* 

• 

* 

K 

.5056651 

.0125282 

-.0308240 

9 

6 

.4662255 

.0006887 

.0086157 

18 

R 

.4616149 

.0006204 

.0132263 

20 

D 

.4720771 

.0010743 

.0027641 

17 

M 

.4614755 

.0006246 

.0133657 

20 

A 

.4635341 

.0006006 

.0113071 

18 

P 

.4627018 

.0005589 

.0121394 

18 

0 

.4695581 

.0008809 

.0052831 

18 

P 

.4641099 

.0007820 

.0107313 

20 


Table 47 

Comparison of Probabilities of Mis classification for 
Several Discriminant Functions, 30 Replications, 
when D - .6 (OPT » .3821), Orientation 2, 

Matrix 4, n - 25 


Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Number of Times 
PMC is Lower 
than That of 
Estimator S 
(max - 30) 

S 

.4627165 

.0006672 

* 

* 

E 

.4640364 

.000867? 

-.0013198 

13 

G 

.4630044 

.0007643 

-.0002878 

15 

R 

.4599910 

.0008376 

.0027255 

17 

D 

.4668319 

.0008721 

-.0041153 

9 

M 

.4538565 

.0008490 

.0088601 

19 

A 

.4613622 

.0009365 

.0013543 

14 

P 

.4629706 

.0008562 

-.0002540 

16 

0 

.46 2409 7 

.0007983 

.0003069 

16 

F 

.4550720 

.0008097 

.0076445 

19 


Table 48 


■Comparison of Probabilities of Misclasslfieation for 
Several Discriminant Functions, 30 Replications, 
when D • 1.0 (OPT - .3085), Orientation 2, 
Matrix 3, n - 10 


Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Number of Times 
PMC is Lower 
than That of 
Estimator S 
(max > 30) 

S 

.4353813 

.0015353 

* 

* 

K 

.4351904 

.0161037 

.0001908 

15 

G 

.4179437 

.0014845 

.0174375 

22 

R 

.4120051 

.0015629 

.023 3762 

21 

D 

.4263450 

.0019593 

.0090362 

16 

M 

.4125457 

.0016542 

.0228355 

21 

A 

.4127833 

.0012942 

.0225979 

19 

P 

.4116455 

.0012239 

.0237358 

20 

0 

.4277025 

.0019486 

.0076788 

18 

F 

.4107671 

.0014623 

.0246141 

22 


Table 49 

Comparison of Probabilities of Misclasslfieation for 
Several Discriminant Functions, 30 Replications, 
when D - 1.0 (OPT - .3085), Orientation 2, 
Matrix 3, n - 25 


Estimator 


Mean 

PMC 


Variance 


Improvement Over 
Estimator S 


Number of Times 
PMC is Lower 
than That of 
Estimator S 
(max ■ 30) 


S 

.4032712 

.0014404 

K 

.4074098 

.0021665 

G 

.4083671 

.0016222 

R 

.39 76231 

.0013577 

D 

.4112139 

.0016231 

M 

.3844087 

.0009758 

A 

.3962300 

.0013994 

P 

.4034277 

.0015071 

0 

.4038810 

.0014969 

F 

‘ .3829356 

.0007398 


* 

* 

.0041386 

14 

0050959 

9 

0056481 

18 

0079427 

8 

0138625 

20 

0070412 

18 

0001565 

13 

0006098 

15 

0203356 

20 
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Table 50 


. Compariaon of Probabilities of Mlsclasslf Icatioo for 
Several Dlscrlainant Functions, 30 Replications, 
iriien 0 ■ 1.0 (OPT ■ .3085), Orientation 2, 
Katrix 4, n ■ 10 


Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Humber of Times 
PMC is Lower 
than That of 
Estimator S 
(max ■ 30) 

S 

.4353760 

.0015352 

* 

* 

K 

.4296659 

.0160573 

.0057101 

15 

G 

.4149252 

.0013206 

.0204508 

23 

R 

.4092804 

.0012022 

.0260956 

21 

D 

.4271838 

.0023609 

.0081922 

15 

M 

.4087731 

.0012207 

.0266029 

21 

A 

.4099878 

.0011568 

.0253882 

21. 

P 

.4086955 

.0010835 

.0266805 

21 

0 

.4242760 

.0016659 

.0111000 

19 

F 

.4105393 

.0015412 

.0248367 

22 


Table 51 

CoDparison of Probabilities of Mlsclasslfication for 
Several Discriminant Functions, 30 Replications, 
vhen D - 1.0 (OPT ■ .3085), Orientation 2, 




Matrix 4, 

n - 25 


Estimator 

^ Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Number of Times 
PMC is Lower 
than That of 
Estimator S 
(max - 30) 

S 

.4032658 

.0014404 

* 

e 

K 

.4001522 

.0016495 

.0031135 

23 

G 

.4017023 

.0013862 

.0015634 

16 

R 

.3936941 

.0011592 

.0095717 

19 

D 

.4109952 

.0016423 

-.0077294 

11 

M 

.3823102 

.0007632 

.0209556 

20 

A 

.3948253 

.0013578 

.0084404 

15 

P 

.3999439 

.0013950 

.0033219 

15 

0 

.3996254 

.0013405 

.0036404 

17 

F 

. .3848586 

.0007157 

.0134072 

21 
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! 

! 1 . Table S2 

1 

1 4 Comparison of Probabilities of Misclassif ication for 

; I Several Discriminant Functions, 30 Replications, 

i vhen D ■ 3.0 (OPT ■ .0668), Orientation 2, 

' Matrix 3, n ■ 10 


Estimate 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Number of Times 
PMC is Lower 
than That of 
Estimator S 
(max ■ 30) 

S 

.2495356 

.0060531 

* 

* 

K 

.1154327 

.0070086 

.1341029 

29 

G 

.1572698 

.0022482 

.0922657 

30 

R 

.1462953 

.0010922 

.103 2402 

27 

D 

.1921527 

.0029914 

.0573829 

21 

M 

.1457720 

.0010216 

.1037635 

27 

A 

.1444945 

.0014105 

.1050411 

28 

P 

.1430998 

.0012529 

.1064358 

27 

0 

.1831795 

.0035418 

.0663560 

30 

. ¥ 

.1366920 

.0006592 

.1128435 

28 


f 

I 

I 

{ . 

Table 53 


Comparison of Probabilities of Misclasslfication for 
Several Discriminant Functions, 30 Replications, 
'irfien D ■ 3.0 (OPT ■ .0668), Orientation 2, 




Matrix 3, 

n - 25 


Estimate 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Nusber of Times 
PMC is Lower 
than That of 
Estimator S 
(max ■ 30) 

S 

.1331344 

.0019566 

* 

* 

K 

.1201515 

.0014764 

.0130329 

22 

G 

.1360312 

.0014838 

-.0028968 

14 

R 

.1158102 

.0005663 

.0173742 

22 

D 

.1393344 

.0017496 

-.0066500 

9 

M 

.1193309 

.0002947 

.0223536 

18 

A 

.1112350 

.0004612 

.0218995 

22 

P 

.1233366 

.0007553 

.0098279 

16 

0 

.1265220 

.0009095 

.0066024 

16 

F 

,.1149391_ 

_ .0002346 

.0152453 

16 





1 


1 
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Table 54 


Comparison of Probablllcles of Misclassificatlon for 
Savaral Dlsertalnant Functions, 30 Replications, 
when D ■ 3.0 (OPT - .0668), Orientation 2, 
Hatrix 4, n ■ 10 


Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Number of Times 
PMC is Lover 
than That of 
Estimator S 
(max > 30) 

S 

.2495033 

.0060513 

* 

* 

K 

.1080565 

.0063426 

.1414469 

29 

6 

.1494012 

.0019507 

.1001021 

30 

R 

.1444231 

.0008757 

.1050802 

28 

D 

.1854059 

.0030886 

.0640974 

24 

M 

.1454299 

.0008298 

.1040735 

28 

A 

.1384619 

.0013410 

.1110414 

29 

P 

.1371177 

.0011693 

.1123856 

28 

0 

.1760073 

.0032529 

.0734961 

30 

F 

.1349167 

.0008031 

.1145866 

28 


Table 55 

Comparison of Probabilities of Misclassificatlon for 
Several Discriminant Functions, 30 Replications, 
when D ■ 3.0 (OPT - .0668), Orientation 2, 
Matrix 4, n ■ 25 


Estimate 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Number of Times 
PMC is Lower 
than That of 
Estimator S 
(max ■ 30) 

S 

.1331843 

.0019555 

* 

* 

K 

.1190056 

.0015548 

.0141787 

30 

C 

.1246015 

.0012961 

.0085828 

20 

R 

.1081321 

.0003603 

.0250521 

24 

D 

.1373143 

.0013923 

-.0041300 

U 

M 

.1072330 

.0001883 

.0259513 

23 

A 

.1074417 

.0003566 

.0257426 

22 

P 

.1164354 

.0006504 

.0167489 

22 

0 

.1188429 

.0008790 

.0143414 

22 

F 

.1145743 

.0002266 

.0186100 

17 


Table S6 


■Coaparlson of Probabilities of Misclassiflcation for 
Several Discrioinanc Functions, 30 Replications, 
when D ■ .C (OPT ■ .3821), Orientation 3, 
Matrix 1, n ■ 10 


Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Number of Times 
PMC is Lower 
than That of 
Estimator S 
(max • 30) 

S 

.4715319 

.0011503 

* 

• 

* 

K 

.5323469 

.0096985 

-.0608149 

9 

G 

.4768534 

.0006249 

-.0053214 

13 

R 

.4778623 

.0004963 

-.0063303 

11 

D 

.4800106 

.0005833 

-.0084787 

12 

M 

.4788593 

.0005130 

-.0073274 

11 

A 

.4770617 

.0005008 

-.0055297 

12 

P 

.4771987 

.0004606 

-.0056668 

12 

0 

.4785215 

.0008829 

-.0069895 

12 

F 

.4803006 

.0003943 

-.0087686 

10 


Table 57 

Comparison of Probabilities of Misclassiflcation for 
Several Discriminant Functions, 30 Replications, 
when D > .6 (OPT ■ .3821), Orientation 3, 
Matrix 1, n > 25 


Estimator 

Mean 

PMC 

1 

Variance 

Improvement Over 
Estimator S 

Number of Times 
PMC is Lower 
than That of 
Estimator S 
(max • 30) 

S 

.4644161 

.0008756 

* 

* 

R 

.47: 5874 

.0017452 

-.0132713 

9 

G 

.4800509 

.0011086 

-.0156348 

4 

R 

.4794635 

.0011293 

-.0150474 

7 

D 

.4800156 

.0011013 

-.0155995 

5 

M 

.4778069 

.0010842 

-.0133908 

11 

A 

.4806339 

.0012203 

-.0162178 

7 

P 

.4801776 

.0011342 

-.0157615 

5 

0 

.4796249 

.0011068 

-.0152088 

6 

F 

.4784315 

.0009580 

-.0140154 

9 


L 

I Table 58 


I 

i 

I 





■Coaparlsoti of Probabilities of Mlsclasslficatlon for 
Several Discriminant Functions » 30 Replications, 
when D ■ .6 (OPT • .3821), Orientation 3, 
Matrix 2, n ■ 10 


Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Number of Times 
PMC is Lower 
than That of 
Estimator S 
(max > 30) 

S 

.4720387 

.0012039 

* 

* 

K 

.5150609 

.0119458 

-.0430222 

10 

G 

.4754963 

.0006800 

-.0034576 

13 

R 

.4759637 

.0004033 

-.0039250 

13 

D 

.4827731 

.0006635 

-.0107344 

10 

M 

.4772016 

.0003850 

-.0051629 

U 

A 

.4757041 

.0004880 

-.0036655 

12 

P 

.4759760 

.0004263 

-.0039373 

12 

0 

.4758769 

.0009777 

-.C338382 

12 

F 

.4795544 

.0003982 

-.0075157 

10 


Table 59 

Comparison of Probabilities of Misclassiflcation for 
Several Discriminant Functions, 30 Replications, 
vhen D ■ .6 (OPT ■ .3821), Orientation 3, 
Matrix 2, n •* 25 


Mueber of Times 


Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

PMC is Lower 
than That of 
Estimator S 
(max ■ 30) 

S 

.4642913 

.0008576 

* 

* 

K 

.4687696 

.0010418 

-.0044783 

9 

G 

.4746042 

.0009332 

-.0103129 

4 

R 

.4769748 

.0009503 

-.0126835 

7 

D 

.4802647 

.0010:02 

-.0159734 

4 

M 

.4781760 

.0009360 

-.0138847 

8 

A 

.4785895 

.0010921 

-.0142982 

6 

P 

.4772830 

.0009977 

-.0129917 

4 

0 

.4758248 

.0009466 

-.0115335 

4 

F 

.4784990 

.0008267 

-.0142077 

8 
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Table 60 


[ 




i 


1 

L 


Comparison of Probabilities of Misclassifieation for 
Several Discriminant Functions* 30 Replications, 
vhen D ■ 1.0 (OPT ■ .3085), Orientation 3, 
Hatrix 1, n ■ 10 


Estimate 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Number of Times 
. PMC is Lover 
chan That of 
Estimator S 
(max ■ 30) 

S 

.4338931 

.0019116 

* 

e 

K 

.4460395 

.0190767 

-.0121464 

14 

C 

.4424260 

.0013891 

-.0085330 

14 

R 

.4454960 

.0011540 

-.0116029 

13 

D 

.4513071 

.0013319 

-.0174140 

12 

■M 

.4474095 

.0011760 

-.0135164 

11 

A 

.4416195 

.0011068 

-.0077264 

13 

P 

.4422021 

.0010307 

-.0083090 

13 

0 

.4483870 

.0020470 

-.0144939 

12 

F 

.4489143 

.0009267 

-.0150213 

10 


I. 

i: 

i 

I 

t 

f: 


Table 61 

Comparison of Probabilities of Misclassifieation for 
Several Discriminant Functions, 30 Replications, 
when D ■ 1.0 (OPT - .3085), Orientation 3, 
Matrix 1, n - 25 


Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

Number of Times 
PMC is Lower 
Chan That of 
Estimator S 
(max *• 30) 

S 

.4049063 

.0015699 

* 

* 

K 

.4135915 

.0031345 

-.0086851 

12 

G 

.4385528 

.0024528 

-.0336465 

3 

R 

.4356605 

.0023311 

-.0307541 

7 

D 

.4394411 

.0024522 

-.0345348 

3 

M 

.4325702 

.0020114 

-.0276639 

9 

A 

.4358236 

.0024450 

-.0309173 

8 

P 

.4370903 

.0024293 

-.0321839 

6 

0 

.4367075 

.0023890 

-.0318012 

5 

F 

.4336094 

.0018005 

-.0287031 

a 


i 


1 


'i 
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Table 62 


Comparieon ef Probabilitlas of Mleclaseifleation for 
Savaral Dleerlminanc Funetiona, 30 Raplleationa, 
when D « 1.0 (OPT > .3085), Oriencacion 3, 
Matrix 2, n ■ 10 


Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator 8 

Mtssber of Tines 
PMC is Lover 
than That of 
Estlnator S 
(max - 30) 

S 

.433A909 

.0017418 

* 

• 

* 

K 

.4246007 

.0204109 

.0088902 

IS 

G 

.4383295 

.0014730 

-.0048386 

13 

R 

.4422662 

.0008799 

-.0087753 

14 

D 

.4558897 

.0016159 

-.0223988 

9 

M 

.4445498 

.0008670 

-.0110589 

14 

A 

.4380882 

.0010576 

-.0045973 

12 

P 

.4389660 

.0009383 

-.0054751 

13 

0 

.4418492 

.0021301 

-.0083583 

12 

F 

.4470639 

.0009323 

-.0135730 

10 


Table 63 

Comparison of Probabilities of Mis classification for 
Several Discriminant Functions, 30 Replications, 
%4ien D - 1.0 (OPT - .3085), Orientation 3, 
Matrix 2, n - 25 


Estimator 

Mean 

PMC 

Variance 

InproveMnt Over 
Estimator S 

Munber of Times 
PMC is Lower 
than That of 
Estimator S 
(max ■ 30) 

S 

.4048470 

.0015816 

* 

* 

K 

.4057734 

.0020781 

-.0009264 

15 

G 

.4241128 

.0018660 

-.0192658 

2 

R 

.4283507 

.0018206 

-.0235037 

6 

D 

.4396899 

.0023539 

-.0348429 

3 

M 

.4323980 

.0016739 

-.0275510 

8 

A 

.4313968 

.0021571 

-.0265498 

6 

P 

.4293107 

.0019954 

-.0244637 

4 

0 

.4262551 

.0018580 

-.0214081 

3 

F 

.4335959 

.0015499 

-.0287489 

7 
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T«bl« 64 


Comp«rlaon of Probabllitloa of Mlselasslfleatlon for 
Sovoral Diaerlffllnant Functlona, 30 Rapllcatlona, 
«ih«& 0 ■ 3.0 (OPT - .0668 ) , Oriencation 3, 
Haerlx 1, n * 10 


Batlaacor 


Moan 

PMC 


Varianca 


XaqtrovoBant Ovar 
Eatlaator S 


Mtabar of Tlaaa 
PMC la Lovor 
than That of 
Eatlnator S 
(aax > 30) 


.2333699 

.1291883 

.2123422 

.2232617 

.2628658 

.2232484 

.2059766 

.2073982 

.2322150 

.2126169 


.0047396 

.0092560 

.0019660 

.0022461 

.0035742 

.0021650 

.0018948 

.0019220 

.0025430 

.0020385 


.1041816 

.0210277 

.0101082 

-.0294959 

.0081215 

.0273933 

.0259717 

.0011549 

.0207530 


Tabla 65 

CoBpariaon of Probablllciaa of HlsclaaslficaCion for 
Savaral Discrlmlnane Functlona, 30 Rapllcatlokia, 

. when 0 ■ 3.0 (OPT - .0668), Orientation 3, 
Matrix 1, n ■ 25 


Eatlaator 


Mean 

PMC 


.1315218 

.1219341 

.1817822 

.1673601 

.1869159 

.1621310 

.1597117 

.1709309 

.1740671 

.1620265 


Variance 


.0018469 

.0014079 

.0015730 

.0006437 

.0018902 

.0003605 

.0004277 

.0008129 

.0010193 

.0004422 


ZaproveiDcnt Over 
Eatlaator S 


.0095876 

-.0502604 

-.0358384 

-.0553941 

-.0306092 

-.0281900 

-.0394091 

-.0425453 

-.0305047 


Muaber of Tlaea 
PMC la Lower 
than That of 
Eatlaator S 
(wax ■ 30) 


19 
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T«bl« 66 


.Cemp«rlson of Prcbabilltlos of MiselMslfleacion for 
Sovoral Olocrlaiaaac Fuaetieno, 30 Roplleotioiui, 
«h«& D ■ 3.0 (OPT ■ .0668) » Orlontaclon 3» 
Matrix 2, a ■ 10 


Eatiaator 

Maaa 

PMC 

Varlaaea 

bprovaaaat Ovar 
Eatiaator S 

MkiBbar of Tiaaa 
PMC la Lowar 
thaa That of 
Eatiaator 8 
(aax ■ 30) 

8 

.2243574 

.0036016 

a 

• 

a 

K 

.1037136 

.0030641 

.1206438 

30 

C 

.1986870 

.0020313 

.0256704 

21 

R 

.2187200 

.0018344 

.0056374 

16 

D 

.2570681 

.0046414 

-.0327107 

11 

M 

.2216029 

.0018980 

.0027544 

16 

A 

.1950727 

.0018596 

.0292847 

21 

P 

.1976837 

.0016971 

.0266737 

20 

0 

.2131977 

.0025543 

.0111597 

16 

F 

.2052253 

.0022348 

.0191321 

20 


Tabla 67 

Coaparlson of ProbabiUclos of Misclaaslflcatloa for 
Savaral Dlacrlalaaat Fuaetloas, 30 Raplleatioaa, 

. whoa D • 3.0 (OPT - .0668) , Orlaatatloa 3, 
Matrix 2, a ■ 25 


Eatiaator 

Main 

FMC 

▼ariaaca 

Xaprovaaaat Ovar 
Eatiaator S 

Minbar of Tiaaa 
PMC la Lowar 
than That of 
Eatiaator S 
(aax ■ 30) 

S 

.1319591 

.0021237 

a 

a 

K 

.1194863 

.0018197 

.0124728 

22 

C 

.1545014 

.0017565 

-.0225423 

2 

R 

.1493955 

.0006182 

-.0174364 

6 

D 

.1871879 

.0015492 

-.0552288 

2 

M 

.1579318 

.0005546 

-.0259727 

5 

A 

.1515277 

.0003614 

-.0195686 

6 

P 

.1536625 

.0007857 

-.0217034 

4 

0 

.1516813 

.0010668 

-.0197222 

4 

F 

.1585516 

.0004067 

-.0265925 

6 
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Tabu 68 


. Coaiparlaon of Probabilltlaa of Miaelaaaif lea cion for 
Savaral OiaerlBinanc PUaetloaOi 30 Raplieaeioaa, 
uhan D ■ .6 (OPT ■ .3821), Onancacion 3, 
Hacrls 3, n ■ 10 


latlaator 

Moan 

PMC 

Tarianea 

Xaprovaaeae Over 
Eatiaacer 8 

liiabar of Tlaaa 
PMC la Lower 
than That of 
Bstlaator S 
(■ax - 30) 

8 

.4717440 

.00U812 

* 

* 

K 

.S12623S 

.0091900 

-.0408795 

9 

C 

.4714682 

.0007407 

.0002758 

11 

t 

.4720959 

.0007014 

-.0003519 

13 

0 

.4769108 

.0008939 

-.0051668 

12 

M 

.4736923 

.0007007 

-.0019483 

13 

A 

.4716619 

.0006418 

.0000821 

13 

P 

.4724154 

.0006048 

-.0006714 

13 

0 

.4727913 

.0009229 

-.0010473 

14 

P 

.4763422 

.0005204 

-.0045982 

U 


Table 69 

CoBparison of Probabilities of Mlaclassifleatloa for 
Several Olscrlalnant Funccicns, 30 Baplicacloas, 
irtien D ■ .6 (OPT - .3821), OrUntatlon 3, 
Matrix 3, A • 25 

Satlaator 

Mean 

PMC 

Variance 

Xapreveaent Over 
Eatlaator 8 

Mueber of Tiaea 
PMC la Lower 
than That of 
Satlaator 8 
(aax • 30) 

S 

.4652408 

.0009935 

* 

* 

K 

.4761015 

.0019887 

-.0108607 

5 

C 

.4726285 

.0012053 

-.0073877 

8 

R 

.4708577 

.0012061 

-.0056170 

13 

D 

.4729653 

.0011982 

-.0077246 

8 

M 

.4685765 

.0011787 

-.0033357 

14 

A 

.4713554 

.0012452 

-.0061146 

12 

P 

.4719655 

.0012121 

-.0067248 

11 

0 

.4718916 

.0011963 

-.00665C8 

11 

P 

.4708172 

.0010039 

-.0035765 

12 
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T«bU 70 


Coaparlton of Probobillcloo of MiteUsoif Ication for 
8«v«r«l Diicriaimme Funetloof, 30 Soplleaclons, 
«fh«n D • .6 (OFT • .3821), Orltntaeion 3, 
Matrix 4, a • 10 


taciaacor 

Mt£B 

PMC 

Parlaaeo 

laprovuiaac Ovar 
Satlaacor 8 

Rti^r of Ttaaa 
PMC la Lovor 
thaa That of 
latlaaeor 8 
(aax • 30) 

8 

.4720831 

.0012788 

* 

* 

K 

.3031372 

.0127667 

-.0310341 

9 

G 

.4700337 

.0007420 

.0020494 

16 

R 

.4694306 

.0003764 

.0026324 

16 

D 

.4772137 

.0009871 

-.0031306 

11 

M 

.4703986 

.0003446 

.0014843 

13 

A 

.4701204 

.0006098 

.0019627 

14 

P 

.4707832 

.0003377 

.0012999 

14 

0 

.4708626 

.0010037 

.0012203 

17 

P 

.4732869 

.0003162 

-.0032038 

13 


Tabla 71 

CoBpariaoa of Probabllitlas of Miaclaaalflcaeioa for 
8av«ral DlscriBlnant PunetiOBa, 30 Baplicatloaa, 
whoa 0 • .6 (OPT • .3821), Orlaataeloa 3, 
Matrix 4, a ■ 25 

Eaciaator 

Maaa 

PMC 

Yarianca 

laprovaaaat Ovar 
Eatiaator 8 

Ruabar of Tiauia 
PMC ia Lowar 
thaa That of 
Paclaacor S 
v&ax • 30) 

8 


.4651296 

.0009873 

* 

* 

X 


.4671420 

.0012007 

-.0020124 

9 

G 


.4673454 

.0010232 

-.0022158 

9 

X 


.4674717 

.0010639 

-.0023421 

13 

0 


.4728941 

.0011811 

-.0077645 

9 

M 


.4674353 

.0010331 

-.0023057 

12 

A 


.4697105 

.0011609 

-.0045809 

13 

P 

r 

.4691317 

.0010917 

-.0040021 

9 

0 

.4679627 

.0010369 

-.0028331 

11 

f 


.4710952 

.0008720 

-.0059656 

12 


4 
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TabU 72 


Comparison of Probabllltlos of Mlselassiflcacion for 
Savaral Discriminant Functions, 30 Replications, 
whan D ■ 1.0 (07T ■ .3083), Orientation 3, 
Matrix 3, n ■■ 10 


Betlmator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

NuoRier of Times 
FMC is Lover 
tban That of 
Estimator S 
(max • 30) 

8 

.4353038 

.0021210 

* 

* 

K 

.4361354 

.0139366 

-.0008317 

13 

6 

.4290623 

.0014621 

.0062413 

16 

R 

.4304039 

.0013238 

.0048978 

16 

D 

.4418043 

.0016709 

-.0065006 

13 

M 

.4331626 

.0013629 

.0021412 

15 

A 

.4286235 

.0013581 

.0066803 

13 

P 

.4301593 

.0013361 

.0051445 

13 

0 

.4338156 

.0018453 

.0014882 

16 

F 

.4384861 

.0011986 

-.0031823 

13 


TabU 73 


Comparison of Probabilities of Misclassification for 
Several Discriminant Functions, 30 Replications, 
i4ien D ■ 1.0 (OPT “ .3085), Orientation 3, 
Matrix 3, n > 25 


Number of Times 


Estimator 

Mean 

PMC 

Variance 

Improvement Over 
Estimator S 

P!R: is Lower 
than That of 
Estimator S 
(max • 30) 

S 

.4055966 

.0018138 

* 

* 

K 

.4148539 

.0031281 

-.0092573 

9 

C 

.4220622 

.0024874 

-.0164655 

7 

R 

.4156192 

.0021668 

-.0100226 

10 

D 

.4241355 

.0024947 

-.0185389 

6 

M 

.4110012 

.0017447 

-.0054046 

14 

A 

.4144310 

.0021750 

-.0088343 

10 

P 

.4187178 

.0023542 

-.0131212 

9 

0 

.4193926 

.0023535 

-.0137960 

9 

F 

.4145477 

.0015400 

-.0089510 

11 


Vibl« 7A 


• Ceapariton of PrObobllltloo of Miaeloooifleotioii for 
Sovorol Dtoeriainonc F«metloni» 30 BoplteottoiM. 
ohm D ■ 1.0 (OPT ■ .30SS)( Orioneocloa 3» 
Kotrix 4, n • 10 


Eotlmtor 

Noon 

PNC 

Tortmeo 

XoiprovoMtte Ovor 
Eselmcor S 

NWbor of Hms 
PMC is loNor 
thon That of 
Eatlaator 8 
(aax ■ 30) 

8 

.4342608 

.0019456 

* . 

a 

K 

.4233397 

.0179305 

.0109212 

IS 

0 

.4245597 

.0013671 

.0097011 

16 

R 

.4270403 

.0012926 

.0072205 

15 

D 

.4426120 

.0021714 

-.0083512 

11 

M 

.4290106 

.0012626 

.0052502 

15 

A 

.4245191 

.0012159 

.0097417 

17 

P 

.4260354 

.0011725 

.0082254 

IS 

0 

.4277009 

.0017663 

.0065600 

17 

P 

.4361893 

.OOU949 

-.0019285 

15 


Tablo 75 

Cooporloon of Probobllltioa of Mlsclualfleotloa for 
Sovorol Ol^rialnont Functions. 30 RopUeotlons. 
ohon D - 1.0 (OPT ■ .3085), Orlontoclon 3, 
Motrlz 4. n • 25 


Ruabor of Timos 


Bselaotor 

Mom 

PMC 

Varlmco 

XaprovoBont Ovor 
EatiBoeor 8 

PMC is Lovar 
than That of 
Bstlaator 8 
(nax - 30) 

8 

.4054626 

.0018726 

* 

a 

K 

.4031859 

.0021245 

.0022766 

22 

6 

.4085762 

.0018364 

-.0031136 

11 

R 

.4078334 

.0017050 

-.0023709 

12 

D 

.4238338 

.0024232 

-.018371*1 

7 

M 

.4078858 

.0014149 

-.0024232 

12 

A 

.4109159 

.0019538 

-.0054533 

11 

P 

.4U4915 

.0019335 

-.0060289 

U 

0 

.4093732 

.0018212 

-.0039107 

11 

P 

.4149809 

• 

.0013582 

-.0095183 

U 


94 


T«bU 76 

OoBpartaon of Frobabtlltlaa of.MlaeUaaieication for 
Savaral Oiaerlainaac Fimetloaa, 30 SapUcatteas» 
vhaa D * 3.0 (OPT ■ .0668), Ortaacatioa 3, 
Kacrtx 3, n •> 10 


BatiMtor 


Kaan 

PMC 


.2293197 

.U224A3 

.1767313 

.1777807 

.2333672 

.1784349 

.1693173 

.1702761 

.1975339 

.1735133 


▼arlanea 


.0037984 

.0044693 

.0018591 

.0018803 

.0037738 

.0018331 

.0017083 

.0018428 

.0024048 

.0018499 


Xaprovanaat Ovar 
Batlaatot S 


.U70754 

.0323884 

.0313390 

-.0040473 

.0308848 

.0600024 

.0390436 

.0317638 

.0338064 


Tabla 77 

Cooparlson of Probabllitlas of Mlselasalficatlon for 
Savaral Diacriainant Functlona, 30 Raplleaciona, 
vhan D • 3.0 (OPT - .0G68), Orleacaeion 3, 
Mat^ 3, a " 23 


Batlaacor 


Maan 

PMC 


Tarlanca 


laprovaaant Ovar 
Eatimacor S 


Nuabar of Tlaaa 
PMC la Lowar 
than That of 
Eatiaator S 
(aax > 30) 


.1304627 

.1213906 

.1496848 

.1308800 

.1341308 

.1231524 

.1243820 

.1367060 

.1409471 

.1250879 


.0017817 

.0017354 

.0013936 

.0003221 

.0019616 

.0003101 

.0002787 

.0006833 

.0009438 

.0002812 


.0090721 

-.0192221 

-.0004173 

-.0236681 

.0053103 

.0060807 

-.0062433 

-.0104844 

.0053748 


5 



Tabu 78 


•Coaparlaeii of ProbabilitUa of MlaeUsalfieatlon for 
Sovoral DlaerlBlaaiic Punetiooa» 30 RopUeations, 
ohoo D ■ 3.0 (OPT ■ .0668) • Orlontotlon 3« 
Matrix 4, n • 10 






MiBKbar of Tlaaa 

laclaator 

bbaa 

PMC 

Farlaneo 

Uptovoaoat Ovor 
Estimator 8 

PNC la Lowor 
thtti That of 
Eatlaator 8 





(max « 30) 

8 

.2330261 

.0048248 

* 

• 

o 

K 

.0975499 

.0027006 

.1354763 

30 

G 

.1639241 

.0016791 

.0691020 

29 

R 

.1730693 

.0017608 

.0599568 

24 

D 

.2249789 

.0035917 

.0080473 

17 

M 

.1760017 

.0017952 

.0570244 

25 

A 

.1585676 

.0016115 

.0744585 

25 

P 

.1598563 

.0016097 

.0731699 

24 

0 

.1808607 

.0025917 

.0521654 

28 

P 

.1669733 

.0020316 

.0660529 

23 

• 


Tablo 79 


Comparison of Probabllltlss of Misclasslfleation for 


Sovoral Dlscrlninanc Functions. 30 Ropllcatlons. 


Oban 0 • 

3.0 (OPT - . 

0668), Orientation 3, 




Matrix 4 

• n • 10 






Ruabor of TUos 

EselBator 

Moan 

PMC 

Varlaneo 

Zaprovoaont Ovor 
Estlaator S 

PMC la Lowor 
than That of 




Estlaator 8 
(aax * 30) 


S 

.1302982 

.0021 U3 

* 

* 


K 

G 

R 

D 

M 

A 

P 

0 

P 


.U47942 

.1259162 

.1161704 

.1509116 

.1197156 

.1176184 

.1225194 

.1225293 

.1223729 


.0017454 

.0013530 

.0003746 

.0014984 

.0002596 

.0002667 

.0006400 

.0009248 

.0002662 


.0155040 

.0043820 

.0141278 

-.0206134 

.0105826 

.0126798 

.0077788 

.0077689 

.0079253 


29 

15 
18 

4 

13 

16 
15 
15 

14 
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